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Abstract 


Minimum  distance  estimators  are  compared  to  the  maximum 
likelihood  estimator  of  the  location  and  scale  parameters 

.  J 

o-f  the  two  parameter  Cauchy  distribution. .  Samples  sizes  of 

<  o 

6,8,10,12,  and  16  are  randomly  drawn  from  a  Cauchy  distribu¬ 
tion  and  used  to  estimate  the  parameters  by  the  maximum 
likelihood  method.  Minimum  distance  techniques  are  then 
used  to  try  improve  upon  the  maximum  likelihood  estimates. 
1000  samples  for  each  sample  size  are  generated  and  the  mean 
squared  error  and  relative  efficiencies  are  computed.  Com¬ 
parison  of  the  two  methods  is  done  by  comparing  the  relative 


efficiencies.  Both  censored  and  non— censored  samples  are 

/ 

used  to  compute  thejninimum  distance  estimates.  / 


COMPARISON  OF  ESTIMATION  TECHNIQUES  FOR 


THE  TWO  PARAMETER  CAUCHY  DISTRIBUTION 

I .  Introduction 

Background 

Military  decision  makers  have  increasingly  been  using 
information  obtained  from  statistical  analysis  in  the  deci¬ 
sion  making  process.  This  information,  which  can  include 
description  of  data,  interpretation  of  data,  and  point  and 
confidence  interval  estimation  (7:93),  allows  decision  mak¬ 
ers  to  be  better  informed  and  to  make  better  decisions.  Two 
areas  in  which  military  decision  makers  have  extensively 
used  statistical  analysis  include  the  reliability  of  weapons 
systems  and  simulation  of  military  systems  and  scenarios. 

For  example,  consider  the  computer  simulation  of  a 
nuclear  war  between  the  Soviet  Union  and  the  United  States. 
This  situation  could  be  modeled  on  a  computer  using  a  simu¬ 
lation  language  such  as  SLAM.  Statistical  analysis  could  be 
used  to  estimate  the  mean  distance  of  a  missile  warhead  from 
a  target  point.  This  estimate  could  be  derived  from  a  ser — 
ies  of  test  flights  where  the  distances  between  impact 
points  and  target  points  are  recorded.  This  estimation 
could  then  be  used  as  a  parameter  in  a  probability  distri¬ 
bution  which  generates  individual  warheads  at  various  impact 
points  over  a  period  of  time.  Other  distributions  such  as 
the  probability  of  detonation  of  a  particular  warhead  could 


be  derived  in  a  similar  manner  using  statistical  analysis. 
These  and  other  -factors  could  then  be  used  to  simulate  a 
real  world  nuclear  engagement.  From  this  simulation,  infer — 
ences  could  be  drawn  about  a  real  world  nuclear  engagement 
and  decision  makers  would  be  better  in-formed  to  make  deci¬ 
sions  concerning  weapon  types,  numbers,  deployment,  etc. 
Clearly,  the  more  accurate  the  parameter  estimation,  the 
more  reliable  the  simulation,  and  hense  the  better  the  in¬ 
formation  available  to  the  decision  maker.  Parameter  esti¬ 
mation  is  therefore  a  critical  step  in  the  decision  making 
process. 

The  beginning  of  parameter  estimation  began  in  the  early 
1800s  when  Legrende  and  Gauss  independently  derived  the 
method  of  least  squares.  Lengrende  used  the  method  to  esti¬ 
mate  the  orbital  parameters  of  comets  and  as  this  author 
well  knows,  the  method  is  still  used  today  to  estimate  the 
orbital  parameters  of  artificial  satellites.  Gauss  gave  the 
probabilistic  basis  for  the  method  and  worked  out  the  compu¬ 
tational  techniques  (22:14).  The  first  major  advance  in  the 
theory  occurred  in  the  late  1800s  when  the  method  of  moments 
was  formulated  by  K.  Pearson.  This  method  is  considered  one 
of  the  classical  estimation  techniques  although  it  is  not  as 
efficient  as  other  methods  used  today.  It  is  usually  em¬ 
ployed  when  other  estimators  are  difficult  to  compute 
(10:6).  The  present  day  foundation  of  estimation  theory  was 
developed  by  R.A.  Fisher  in  the  1920s.  His  method,  called 


the  method  of  maximum  likelihood  is  usually  superior  to  the 


method  of  moments.  Gauss  had  actually  anticipated  the  max¬ 
imum  likelihood  method,  but  felt  that  it  was  inferior  to  the 
method  of  least  squares  (10:6). 

A  more  recent  method  of  parameter  estimation  is  the  min¬ 
imum  distance  method.  This  method  was  developed  by  Wolfo- 
witz  in  the  1950s  under  a  contract  with  the  Department  of 
Defense.  The  method  assumes  a  probability  distribution, 
calculates  an  initial  estimate  of  the  parameter  by  another 
method,  and  then  tries  to  improve  that  estimate  by  minimiz¬ 
ing  the  distance  between  the  empirical  distribution  and  the 
assumed  or  theoretical  distribtion.  Wolfowitz  states  that 
"the  minimum  distance  method  will,  in  a  wide  variety  of 
cases,  furnish  super — consistent  estimators  even  when  the 
classical  methods,  such  as  the  maximim  likelihood  methods, 
fail  to  give  consistent  estimators'*  (29:9,30:203).  Distance 
estimation  techniques  for  various  distributions  have  also 
generally  shown  improvement  over  the  maximum  likelihood 
estimators  (4:9). 

Comparison  of  the  minimum  distance  estimator  with  oth¬ 
er  estimators  has  been  an  effort  guided  by  Dr  Albert  H. 
Moore,  a  professor  of  statistics  at  the  Air  Force  Institute 
of  Technology.  He  has  directed  several  thesis  efforts  in 
estimating  distribution  parameters  by  the  minimum  distance 
technique  and  comparing  these  estimations  with  other  esti¬ 
mators.  These  thesis  efforts  have  included  comparing  esti¬ 
mators  of  the  f our-parameter  beta  distribution,  the  exponen¬ 
tial  distribution,  the  f our-parameter  generalized  gamma  dis- 
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tribution,  the  generalized  t  distribution,  the  three  para¬ 
meter  Weibull  distribution,  and  the  three  parameter  log¬ 
normal  distribution.  An  unpublished  thesis  concurrently 
compared  minimum  distance  estimators  with  other  estimators 
of  the  three  parameter  Pareto  distribution. 

Another  distribution  in  which  minimum  distance  estima¬ 
tion  has  not  yet  been  fully  explored  is  the  Cauchy  distribu¬ 
tion.  The  Cauchy,  named  after  the  French  mathematician  Aug¬ 
ustus  Cauchy,  is  a  probability  distribution  which  has  been 
largely  ignored  in  modeling  because  the  expected  value  of 
its  parameters  are  not  well  defined  and  are  difficult  to 
estimate  accurately  by  conventional  methods.  The  distribu¬ 
tion  has  received  more  attention  lately,  however,  due  to  the 
computer’s  ability  to  quickly  compute  estimates  of  its  para¬ 
meters.  The  Cauchy  has  been  shown  useful  in  mechanics, 
electrical  theory,  and  measurement  and  calibration  problems 
(9:418),  and  has  lately  been  considered  as  an  alternative  to 
the  normal  distribution  in  describing  error  distributions 
where  the  error  terms  have  very  long  tails  (13:114,23:9). 
This  might  find  application  in  simulation  studies  (26:2). 
Also,  infinite  variance  distributions  (such  as  the  Cauchy) 
have  been  considered  in  time  series  analyses  of  economic 
data  (11:275). 

Problem  Statement 

Minimum  distance  estimation  has  not  been  applied  to 
estimators  of  the  location  or  scale  parameter  of  the  Cauchy 
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distribution.  A  controlled  experiment  needs  to  be  performed 
to  determine  if  minimum  distance  can  improve  upon  estimators 
such  as  the  maximum  likelihood  estimator  (mle). 

Research  Question 

Will  minimum  distance  estimation  improve  the  estimation 
of  the  location/scale  parameter  over  that  of  the  mle?  If 
the  estimation  of  the  location/scale  parameter  is  improved, 
will  the  new  estimate  improve  the  maximum  likelihood  esti¬ 
mate  of  the  scale/ location  parameter?  How  much  will  the 
minimum  distance  technique  improve  the  estimates?  Will 
using  censored  samples  (samples  with  certain  data  points 
removed)  make  a  difference  in  the  estimates? 

Methodology 

The  research  questions  will  be  answered  by  setting  up 
a  control led  .environment  and  performing  a  Monte  Carlo  analy¬ 
sis.  First,  Cauchy  random  variates  will  be  generated  from  a 
Cauchy  distribution.  The  Cauchy  random  variate  generator 
GGCAY  found  in  the  International  Mathematical  Statistics 
Library  (IMSL)  will  be  utilized  to  generate  the  necessary 
samples.  Sample  sizes  of  6,8,10,12,  and  16  will  be  genei — 
ated  with  a  location  parameter  of  0  and  a  scale  parameter  of 
1.  Both  censored  and  non-censor ed  samples  will  be  used.  A 
second  set  of  samples  with  sample  sizes  of  6  and  16  will  be 
generated  with  a  location  parameter  of  -2  and  a  scale  para¬ 
meter  of  5.  This  variation  in  the  location  and  scale  para¬ 
meters  will  allow  investigation  of  the  invariance  of  the 


estimators 


Once  the  necessary  samples  are  generated,  the  location 
and  scale  parameters  will  be  estimated  by  the  mle  method. 
Depending  on  the  distance  measure  used,  the  minimum  distance 
technique  will  try  to  improve  the  mle  of  the  location/scale 
parameter.  Both  estimates  will  be  compared  to  the  original 
parameter  value  to  sera  which  one  is  better.  If  the  loca¬ 
tion/scale  estimate  is  improved,  a  new  scale/ location  para¬ 
meter  will  be  estimated  using  the  mle.  This  new  estimate 
will  be  compared  to  the  original  mle  of  the  scale/ location 
parameter  to  see  if  there  is  any  improvement.  Finally,  the 
degree  of  improvement  will  be  calculated  for  both  parameters 
and  the  results  tabulated. 

Chronology 

Chapter  Two  will  describe  the  Cauchy  distribution. 
Chapter  Three  will  discuss  estimators  of  the  Cauchy  distri¬ 
bution  with  emphasis  on  the  maximum  likelihood  estimator. 
Chapter  Four  will  discuss  minimum  distance  estimation  and 
describe  the  different  distance  estimators  including  estima¬ 
tors  using  censored  samples.  Chapter  Five  will  discuss  the 
Monte  Carlo  analysis,  describe  how  to  compute  the  distance 
measures,  discuss  the  programming  involved,  and  define  the 
criteria  by  which  to  evaluate  the  methods.  Chapter  Six  will 
discuss  results,  conclusions,  and  suggest  follow-on  work. 


6 


II.  THE  CAUCHY  DISTRIBUTION 


Introduction 

The  Cauchy  is  a  special  form  of  a  general  class  of 
distributions  called  the  Pearson  Type  VII  distributions 
(16:154)  which  are  unimodal  and  symetnc  about  a  central 
location  parameter.  They  have  the  functional  form 

dF  «  (l/k^>(  1  +  (  (x-A>  /*)2  )‘m  dx  2-1 

where  m  >  1/2 

and  k  is  a  constant  (19:302). 


The  location  parameter,  A  ,  serves  both  as  the  unique  median 
and  mode.  The  distribution  reduces  to  the  Cauchy  when  m 
equals  1  and  k  equals  7 T,  and  looks  very  similar  to  the 
normal  as  shown  in  Figure  1,  although  it  is  more  peaked  and 
thicker  tailed.  Because  of  the  Cauchy’s  infinite  variance, 
there  is  no  standardized  form  of  the  distr ibution,  however, 
the  Cauchy  does  have  a  standard  form  by  setting  the  location 
parameter  to  0  and  the  scale  parameter ,  l// ,  to  1.  When  this 
is  done,  the  form  is  the  same  as  the  t-distr ibution  with  one 
degree  of  freedom. 


Probability  Density  Function 
The  probability  density 
has  the  form 

»  l/(r(  l 


function  (pdf)  of  the  Cauchy 


+  ((;<-X)/*)2>  2-2 
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Figure  1.  Normal  versus  Cauchy 


with 


(  -  oo  <  x  <  +  ao  ) 


(  -  as  <  A  <  +  oo  ) 

(  Ip  >  O  ) 

where  A  >  as  already  mentioned,  is  the  location  parameter 
and  ip  is  the  scale  parameter.  The  distribution  is  symetric 
about  the  median  A  with  the  upper  and  lower  quartiles  equal 
to  \±  p. 

One  reason  why  the  Cauchy  has  not  been  used  in  common 
statistical  applications  is  because  the  statistical  theory 
that  exists  for  the  normal  is  nonexistent  for  the  Cauchy 
(11:275).  This  is  partially  because  the  expected  value 
<E(x))  of  the  location  parameter  (also  known  as  the  first 
moment)  is  not  well  defined.  Since  E(x)  is  defined  as  the 
integral  of  xf(x)  and  as  b  approaches  +00  and  a  approaches 
-00  , 

rb 

/x<  1/(JT(  1+  (  lx-\)/p)2  )  >dx  = 

a 

(l/2r)  Clog  (1+b2) -log  (1+a2)  3  2-3 

which  is  infinite,  the  expected  value  of  location  is  not 
well  defined  (32:205)  (Note:  In  the  above  integral,  p=  1 
and  A  *»  0.  The  result  would  be  the  same  if  the  parameters 
held  different  values) .  Therefore  the  mean  value  of  a 
Cauchy  sample  cannot  represent  the  expected  value  of  the 
location  parameter  and  regardless  of  how  large  the  sample 
is,  the  variability  of  the  mean  about  a  central  value  does 
not  decrease  as  the  sample  size  increases  (9:417).  There¬ 
fore,  the  sample  mean  is  not  a  useful  estimate  of  the  loca- 
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tion  parameter. 

The  other  problem  in  using  the  Cauchy  in  statistical 
applications  lies  in  the  Cauchy’s  infinite  variance  (also 
known  as  the  second  moment).  This  is  also  a  direct  result 
of  (2-3).  This  can  create  profound  problems  because  as 
Granger  reports: 

alternative  measures  of  dispersion  such  as 
absolute  deviations  are  much  harder  to  work  with, 
and  in  a  regime  of  infinite  variance,  the  squares 
of  deviations  become  suspect  as  a  measure  of  dis¬ 
persion,  since  the  means  of  squares  diverge  in  the 
non-Gaussian  populations  (11:276). 

Other  estimates  of  the  parameters  must  therefore  be  used. 
With  the  aid  of  the  computer,  however,  many  estimates  can  be 
easi 1 y  eval uated . 

Cumulative  Distribution  Function 

Integrating  (2)  from  -00  to  +  QO  yields  the  cumulative 
distribution  function  (cdf)  which  has  the  form 

F(x)  =  1/2  +  ( l/w) arctanf (x-X) /^>  2-4 

Stable  Law  and  the  Character i sti c  Function 

The  Cauchy  distribution  follows  stable  law  which  im¬ 
plies  that  the  sum  of  two  Cauchy  variates  is  also  Cauchy 
distributed  (19:273).  Stable  distributions  are  usually  de¬ 
fined  by  their  characteristic  function  (20:918)  because  very 
few  can  be  written  explicitly  (11:275).  The  Cauchy,  how¬ 
ever,  can  be  written  explicitly  as  (2-2). 


The  Cauchy  empirical  characteristic  function  has  the  form 


oo 

Cn(t>  «  / el,z  dGn(z)  ,  -  CO  <  t  <  +  CO  2-5 

-oo 

as  described  by  Parr  (25:1207). 

Important  Statistics 

Because  the  mean  and  standard  deviation  of  the  Cauchy 
sample  are  not  useful  in  estimating  the  location  and  scale 
parameters,  other  estimates  must  be  used.  Two  easily  com¬ 
puted  estimators  are  the  median  for  the  location  parameter 
and  the  semi -i nterquarti le  range  for  the  scale. 

The  median  is  defined  as  follows.  After  obtaining  a 
Cauchy  sample  of  size  N,  the  independently  random  variates 

Xi  of  the  sample  are  ordered  such  that  X1<X2< . Xn. 

The  median  is  then  defined  as  the  .5  quantile  of  the  sample 
or 


med  i  an  3  X  5  . 

If  the  sample  size  is  odd,  the  median  will  be  the  value  of 
the  middle  variate  or  X^n+ 1 ^  •  If  the  sample  size  is  even, 
the  median  will  be  the  average  of  the  middle  two  variates  or 

X  =  (1/2)  (  Xi  +  X/n+2\  )  . 

2  V"3“/ 

The  semi -i nterquarti 1 e  range  is  simply  defined  as  the 
sum  of  the  average  of  the  .25  and  .75  quantiles  (9:216)  or 

semi -interquarti 1 e  range  =  (1/2)  (  X-5  +  X7,  ). 


In  this  thesis,  a  modified  semi-interquartile  range  is 
used.  The  initial  starting  point  is  determined  by  first 
computing  the  25%  and  75%  quantiles.  The  distance  from  the 
median  to  each  quantile  is  then  computed  and  the  smaller  of 
these  distances  is  doubled  and  used  as  the  estimate. 

Order  Statistics  and  Censored  Samples 

If  a  random  sample  of  independent  random  variables 

Xl,X2,....Xn  are  ordered  such  that  XI <  X2 < . Xn,  then 

the  ordered  variables  are  called  the  corresponding  order 
statistics  (16:157)  and  each  order  statistic  from  the  Cauchy 
distribution  has  a  probability  density  function  of 

P  ,  <x>  ■  <n!/<  (r-1)  !  <n— r)  !  )  (1/2  +  ( 1/# )  arctant  (x- a)  /♦  > )  V 

Ar 

(1/2  -  ( 1  /»)  arctan-C  (x-A)/#})  (1  /<**))<  1  +  <  (x- a) /+ 3  )  2-6 

The  expected  values  the  first  and  last  order  statistics  and 
the  variances  of  the  second  and  next-to-last  order  statis¬ 
tics  are  infinite  for  Cauchy  ordered  samples  (16:157).  The 
sample  is  said  to  be  censored  when  the  first  two  and  last 
two  variates  of  the  ordered  sample  are  not  used  in  estimat¬ 
ing  the  Cauchy  parameters.  These  variates  are  called  the 
censored  variates. 

Mathematical  Applications 

If  U  and  V  are  independent  unit  normal  variables,  then 
the  distribution  of  U/V  has  a  standard  Cauchy  distribution 
(16:160).  The  common  distribution  of  U  and  V  does  not  have 
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to  be  normal .  If  c  and  d  are  nonzero  constants  and  X  is 


Cauchy  distributed,  then  cX  +  dX  is  also  Cauchy  distributed, 


The  reciprocal  of  a  Cauchy  variable  has  a  Cauchy  distribu¬ 
tion  as  wel 1 . 

Geometrical  1 y ,  the  Cauchy  represents  the  diagram  in 
Figure  2.  If  the  angle  OAP  is  uniformly  distributed  between 
-  7T/2  to  7772,  the  distance  OP  is  Cauchy  distributed.  This 
situation  might  arise  if  a  particle  is  emitted  from  a 
source  point  A  and  travels  in  a  straight  line  to  an  impact 


point  P  on  a  line  perpendi cul ar  to  AO  and  in  the  same  plane 


III.  Parameter  Estimation  of  the  Cauchy  Distribution 

Introduction 

Parameter  estimation  was  introduced  in  Chapter  One. 

This  chapter  will  continue  the  discussion  o-f  parameter  es¬ 
timation  with  specific  reference  to  the  Cauchy  distibution. 
It  should  be  noted  that  the  term  "efficiency"  in  the  follow¬ 
ing  discussion  refers  to  the  accuracy  of  an  estimator  or  how 
close  the  estimator  actually  approaches  the  real  value  of 
the  parameter. 

Estimators  of  the  Cauchy  Distribution 

There  are  several  ways  to  estimate  the  parameters  of 
the  Cauchy  distribution.  One  of  the  more  simpler  and  reli¬ 
able  estimates  of  location  is  the  median  (1:240)  which  is 
often  used  along  with  the  semi-interquarti le  range  as  init¬ 
ial  estimates  for  iterative  methods.  Two  other  more  effi¬ 
cient  methods  (2:1205-1206)  include  the  use  of  order  statis¬ 
tics  and  the  maximum  likelihood  estimator  (mle).  The  method 
of  moments,  which  can  be  used  with  many  other  distributions, 
does  not  apply  to  the  Cauchy  because  as  shown  in  Chapter 
Two,  the  moments  of  the  Cauchy  do  not  exist. 

The  literature  search  revealed  several  methods  which 
rely  on  the  use  of  order  statistics.  One  the  most  efficient 
order  statistic  methods  is  the  Best  Linear  Unbiased  Estima¬ 
tor  (BLUE)  proposed  by  Barnett  in  1966.  The  BLUE  involves 
extensive  calculations  of  the  variances  and  covariances  of 
the  order  statistics.  These  values  have  been  calculated  and 
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tabulated  -for  samples  o-f  size  n=5  ( 1 )  16  (2)  20  and  achieve  full 
asymptotic  efficiency  and  small  sample  efficiencies  of  at 
least  80'/.  when  compared  to  the  mle  (2:1206).  Barnett  says 
that  the  evaluation  of  the  estimate  with  the  aid  of  tabulated 
coefficients  is  relatively  simple  when  compared  to  the  mle. 
Other  order  statistics  methods,  although  easier  to  compute, 
are  less  efficient  than  the  BLUE  (21:211-212). 

Since  the  mle  does  not  exist  in  closed  form,  it  must  be 
computed  numerically  (13:114).  Barnett  reported  a  potential 
problem  of  multiple  solutions  in  solving  the  likelihood 
equations  but  Haas,  et  al ,  reported  that  while  actually 
sampling  from  the  Cauchy  distribution,  multiple  solutions 
were  never  found  (12:404).  They  conjectured  that  the  solu¬ 
tion  of  the  likelihood  equations  will  always  be  unique  for 
distinct  sample  sizes  of  three  or  more  and  Copas  later  showed 
this  to  be  true  (8:701).  Thus  the  difficulties  in  comput¬ 
ing  the  mle  as  forseen  by  Barnett  are  not  as  formidable  as 
previously  thought. 

A  more  recent  method  of  estimating  the  Cauchy  para¬ 
meters  is  through  the  use  of  window  estimates  proposed  by 
Higgins  and  Tichenor  (14:157).  This  is  a  closed  form  method 
and  has  the  same  asymptotic  distributions  as  the  maximum 
likelihood  estimates  (13:114).  These  estimates  have  high 
efficiencies  along  with  the  maximum  likelihood  estimates  for 
moderate  to  large  sample  sizes  (14:164).  Smaller  sample 
sizes,  however,  are  less  efficient.  For  example,  the  vari¬ 
ance  of  the  estimate  of  the  location  parameter  by  window 


estimation  is  larger  than  that  of  a  similar  estimate  by  the 
mle  for  a  sample  size  of  10  (14:164).  Higgins  does  report, 
however,  that  a  major  advantage  of  his  method  is  its  ease  of 
computation  (13:113). 

Another  recent  estimation  technique  involves  order 
statistics  and  use  of  the  empirical  characteristic  function 
(21:205).  The  efficiencies  of  these  estimates  were  found  to 
be  superior  to  the  BLUE  but  the  author  did  not  compare  his 
estimates  to  that  of  the  mle.  Observing  the  variance  of  the 
characteristic  function  estimator  for  a  sample  size  of  10 
reveals  that  this  variance  is  larger  than  that  of  the  mle. 

The  mle,  in  all  cases  considered,  proved  to  be  the  most 
efficient  estimator.  For  this  reason,  the  mle  was  chosen  to 
be  the  initial  estimate  of  the  Cauchy  parameters  before  ap¬ 
plying  minimum  distance  techniques. 

Max i mum  Li kel ihood  Estimator 

The  maximum  likelihood  function,  L (X 1 , X2, . . . Xn; X , i) ,  for 
a  sample  size  n  from  the  Cauchy  distribution  is  given  by 
Haas,  et  al ,  as 

n 

L  (XI ,  X2,  ...  Xn;  A  ,  =  II  C  l/(7T^(l  +  (  Xi  -  X  /\p ) 2  )  )  1  3-1 

i  =  1 

with  the  logarithm  of  the  likelihood  function  as 

n 

Log  L  =  -n  (log  (70)  -  n(log(\)>  -  log<l+((Xi-\)/^)^>.  3-2 

i  =  l 

Hense  the  maximum  likelihood  equation  are 
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n 

<<Xi-A>/^)/{  1  +  (  <Xi-A>  /yp)1  3-  =  0  3 

i  =1 
n 

1/C  1  +  (  (Xi-A) /4,)1  )  3  =  1  /  (2n )  3 

i  =  1 

where  A  and  \p  are  the  estimates  of  the  location  and  scale 
respectively.  The  equations  must  must  be  solved  numerically 
(12:404) . 

Haas  used  the  Newton-Rhapson  method  to  solve  the  equa¬ 
tions  but  they  can  also  be  solved  by  an  iterative  method. 
This  was  the  method  used  by  the  Princeton  Study  (.1:16)  and 
the  iterative  solutions  are  shown  in  equations  3-5  and  3-6. 
Equation  3-5  is  used  to  estimate  the  location  parameter 
while  equation  3-6  is  used  to  estimate  the  scale. 

W  £  <Xi/C*k2  +  J>/  E  *  <*i-V  3-5 

1=1  1=1 

. _  3  n 

n/c  2(V2  £  1/(^?  +  <xi-xk>2>].  3-6 

i  =  1 

Both  equations  require  initial  starting  values  to  initiate 
the  iteration.  The  median  is  used  as  the  starting  point  for 
the  location  and  is  designated  by  Xk  .  The  modified  semi- 
i nterquart i 1 e  range  is  used  as  the  starting  point  for  the 
scale  and  is  designated  by  .  This  was  the  estimate  used 
in  the  Princeton  study.  Each  succeeding  iteration  uses  the 
most  previous  iterated  values  of  location  and  scale  as  ini¬ 


tial  starting  values. 


Both  equations  3-5  and  3-6  were  incorporated  into  a 
Fortran  subroutine  called  CMLE  (Cauchy  Maximum  Likelihood 
Estimator)  to  generate  the  estimates.  The  routine  was  a- 
dapted  from  the  Fortran  function  listed  in  the  Princeton 
Study  (1:262).  A  listing  of  the  subroutine  can  be  found  in 
the  main  program  CLPEE  in  Appendix  B.  Results  of  the  esti¬ 
mates  are  discussed  in  Chapter  Five. 


IV.  Minimum  Distance  Estimation 


Introduction 

The  minimum  distance  <MD>  method  o-f  parameter  estima¬ 
tion  was  pioneered  by  Wolfowitz  during  the  1950s.  In  his 
■fundamental  paper  (1957),  he  outlined  the  MD  method,  gave  a 
number  o-f  examples  concerning  its  use,  and  proved  consistency 
o-f  the  estimators  (converging  with  a  probability  o-f  one) 
(24:616,31:75).  A  decade  later,  other  work  ■followed  includ¬ 
ing  Sahler  (1970),  who  surveyed  a  variety  of  empirical  dis¬ 
tributions,  outlined  conditions  for  the  existence  and  con¬ 
sistency  of  MD  estimators,  and  proved  an  asymptotic  normal¬ 
ity  result  (18:33).  Knusel  (1969)  examined  the  robustness 
of  the  MD  method  and  showed  that  MD  estimators  have  similar 
robust  properties  as  the  maximum  likelihood  estimators.  Ro¬ 
bustness,  here,  refers  to  the  ability  of  an  estimator  to  be 
reasonably  accurate  even  though  the  underlying  distribution 
deviates  somewhat  from  the  theoretical  distribution.  Parr 
and  Schucany  also  investigated  the  robustness  properties  of 
MD  estimators  and  used  MD  techniques  to  estimate  the  loca¬ 
tion  parameter  of  the  normal  distribution.  They  used  samples 
from  a  wide  variety  of  symetric  distributions  including  the 
Cauchy.  Other  leaders  in  the  field  of  MD  estimation  include 
Blackman,  Rao,  and  Littlell.  For  a  complete  list  of  contri¬ 
butions  on  MD  estimators,  see  Parr’s  bibliography  on  minimum 
distance  estimation  (23). 
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Definition  and  Notat i on 


Basically  the  MD  method  involves  minimising  the  dis¬ 
tance  or  discrepency  between  two  cumulative  distribution 
•functions.  One  function  is  the  cumulative  empirical  distri¬ 
bution  function  <edf )  which  is  determined  from  an  ordered 
random  sample  Xl,X2,...Xn  where  X14X2£..Xn.  The  cumulative 
edf  is  defined  as  the  function  which  equals  the  fraction  of 
Xis  that  are  less  than  or  equal  to  x  for  each  x,  -OO  <  x  < 

+  00  <7:69>.  The  function  is  a  step  function  and  can  be 

written  as 


Gn(x)  =  i /n  4-1 

where  i  is  simply  the  position  number  of  the  Xi  point.  For 
all  x  in  the  interval  Xi  <  x<  Xj+|  ,  Gn(x>  is  equal  to  i/n. 

The  second  function  is  the  assumed  cumulative  probability 
distribution  and  comes  from  a  parameterized  family  of  theo¬ 
retical  distribution  functions.  Fp C x;0eQ)  represents  the 
theoretical  funcion  where  d  is  a  particular  parameter  and  £1 
represents  the  parameter  space.  In  this  thesis,  F^  represents 
the  Cauchy  distribution  with  estimated  scale  and  location 
parameters  and  Gn  represents  the  edf  based  on  the  random 
sample.  Figure  3  compares  the  edf  for  both  the  edf  and 
theoretical  distribution. 

The  discrepency  between  the  two  is  measured.  The  par¬ 
ameter  being  estimated  is  then  altered  by  a  small  value  and 
a  new  discrepency  is  calculated.  The  new  discrepency  is 
compared  with  the  old  one  and  the  one  which  has  the  mimimum 
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value  is  retained  along  with  the  parameter  value  that  gave 
the  minimum  discrepency.  This  procedure  is  repeated  until 
the  minimum  value  over  a  wide  range  of  parameter  values  is 
obtained. 

Minimum  distance  estimation  is  closely  related  in  theory 
to  another  statistical  procedure  called  goodness-of  — f  i  t .  In 
a  goodness-o-f  — f  i  t  test,  one  tries  to  determine  i -f  a  particu¬ 
lar  set  o-f  data  belongs  to  a  particular  probability  distri¬ 
bution.  The  sample  data  is  used  to  determine  the  ed-f  and  to 
estimate  the  parameters  o-f  the  theoretical  distribution. 

The  discrepency  between  the  ed-f  and  the  theoretical  distri¬ 
bution  is  then  calculated  and  compared  to  a  value  in  a  good- 
ness-o-f— fit  table  to  determine  the  data’s  confidence  level 
in  belonging  to  the  theoretical  distribution.  MD  estimation 
extends  the  goodness-o-f -f i t  process  by  minimizing  the  dis¬ 
crepency  through  alteration  of  the  distribution’s  parameters 
and  thereby  estimating  those  parameters. 

The  descrepency  in  MD  estimation  or  a  goodness-of-f it 
test  can  be  defined  in  a  number  of  ways  because  the  methods 
do  not  depend  upon  any  particular  defintion  of  distance 
(29:204).  In  fact,  several  distance  defintions  are  discus¬ 
sed  in  this  chapter  and  employed  in  this  thesis. 

Weighted  Kolmogorov  Pi  stance 

The  weighted  Kolmogorov  distance  is  defined  as 

D0<Gn,F0>  =  supx  I  Gn  (x )  -  Fj(x)l  ♦<FJ(x)l.  4-2 
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This  distance  measure  is  well  known  from  its  use  in  the 
Kolmogorov-Smi rnov  goodness-of-f it  tests.  The  distance  is 
defined  as  the  maximum  vertical  distance  (also  known  as  sup) 
between  the  edf  evaluated  at  a  given  point  and  the  value  of 
the  theoretical  distribution  at  the  same  point.  The  weight¬ 
ing  function,  0(F^(x)),  assigns  weights  for  each  point  and 
in  this  thesis  is  chosen  to  be  equal  to  one. 

When  using  censored  samples,  the  distances  between  cen¬ 
sored  variates  and  the  edf  are  not  used  in  determining  the 
Kolmogorov  distance.  Everything  else  remains  the  same.  The 
value  of  the  edf  does  not  change  and  still  uses  the  original 
sample  size  and  variate  postion  in  determining  its  value. 

Weighted  Cramer — von  Mises  (CVM)  Distance 

The  weighted  CVM  distance  is  defined  mathematically  as 

/oo 

( Gn ( x  )  -  Ffl  Cx>  )2$  (F9  (X)  >dFtf  (x)  .  4-3 

-oo 

This  distance  represents  the  area  discrepency  between  the 
edf  and  the  theoretical  distribution.  As  with  the  Kolmogorov 
distance,  there  is  a  weighting  function,  0(Fg(x))f  which  is 
assigned  the  value  of  one. 

For  censored  samples,  the  distance  measure  definition 
supplied  by  Sweeder  (28:208)  may  be  used  which  is 

/xm  a  x 

(Gn  (x  )  -  F^  (x)  >2^(Ftf  (x)  )dFtf  (x)  .  4-4 

xmin 
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For  the  CVM  discrepancy  with  $  equal  to  one,  this  is 
de-fined  as 


2  "-3  f  *i+l 

w  -  E  /  <i 

1=3  Jx. 


i/n  -  Fa (x > )  dFfl  (x ) 


4  ! 


This  can  be  rewritten  as 


"-3  [xi+i 

El  <<i 

i*3  Jx j 


/n)  2  -  (i/n)Fg(x>  +  F^  (x  )2  )  dF^  (x ) 


or 


n-3 

E 

i  =3 


f*  i+1  2  /*rti+l 

I  (i/n)  dF^(x)  -/  (i/n)Fn (x)dF„ (x)  + 

A,  A<| 

r 


i+1  2 

F_(x)  dF.  (x) 
0  o 


4 


Integrating  yields 


n-3 


E 

i  =3 


( 


(i/n)  F  (x)  -  (i/(2n)  )Fj(x)  +  (l/3)F^(x) 


)  ir 


1  J 


4  ! 


which  is  the  Cramer — von  Mises  distance  for  censored  samples 


Anderson-Dar 1 ing  (AD)  Statistic 


This  distance  is  represented  mathematically  as 


Aj(Sn,Fd)  = 


/ 


oo 


(Gn(x)  -  Fj(x)>  ( 1/ (u ( 1-u) ) )dFj  (x)  .  4 


-oo 


This  distance  is  actually  the  Cramer-von  Mises  statistic 


with  the  weighting  function  equal  to  (l/(u(l— u) ) )  and  u 


equal  to  the  theoretical  distribution.  It  can  be  seen  by 
inspection  that  this  weighting  function  gives  more  weight  to 
the  tails  of  the  distribution. 

For  censored  samples,  this  measure  is  defined  as 


Fe  (x)  )2  <1/  <Fg  <x)  (l-Fg  <x>  )  >)dFe  (x)i 
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Rewriting  and  integrating  yields 


n-3 

L 

i~3 


( 


(i/n)“  ln<F(?  (x)  /  (1-F0  (x)  )  )  +  2  (i /n)  In  ( 1-F0  (x )  ) 


-  Fe  (x)  -  1  n  ( 1  — F^  (x)  > 


jr 


xi+i 
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The  Kuiper  Distance 

The  Kuiper  distance  is  defined  as 


V  (Gn,  Fg  )  -  suP-«,<a<b<co  *  <6n(b)  "  Gn(a))  - 

< Fe (b)  -  F$  <a) ) | .  4-12 

This  distance  is  related  to  the  Kolmogorov  distance  in  that 
it  also  uses  the  vertical  distance  between  the  edf  and  the 
theoretical  distribution.  Two  distances  are  calculated, 
however.  The  first  distance  is  from  the  edf  to  the  theoret¬ 
ical  distribution  where  the  edf  value  is  greater  than  the 
theoretical.  The  second  distance  is  from  the  theoretical 
distribution  to  the  edf  where  the  theoretical  value  is 


greater  than  the  edf 


These  distances  are  illustrated  in 


Figure  4.  Sup  values  are  recorded  -for  each,  summed,  and  the 


sum  is  then  minimized  using  MD  techniques.  The  points  a  and 
b  represent  the  points  at  which  the  sups  occur  and  de-fine 
the  maximal  probability  interval.  This  distance  tends  to 
give  more  accurate  estimates  for  scale  parameters  than  it 
does  for  location  parameters  (18:38).  Censored  samples  are 
handled  in  the  same  way  as  the  Kolmogorov  distance. 


Watson  Pi  stance 

The  Watson  distance  is  derived  from  a  general  class  of 
discrepencies  defined  as 


/ 


00 


Zf  b  (Gn,F„)  =  a  /  ((Gn(x)  -  F„  (x )  )  *  dFfl  (x ) 


b 


/ 


-oo 


oo 


(Gn  (x)  -  F$  (x) )dF#  (x) 


-00 
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When  the  parameter  a  and  b  equal  1  and  0  respectively,  the 
discrepency  reduces  to  the  Cramer — von  liises  statistic.  When 
a  equals  1  and  b  equals  -1,  the  discrepency  is  defined  as 
the  Watson  statistic  (24:616).  As  with  the  Kuiper  distance, 
the  Watson  distance  tends  to  give  better  estimates  of  the 
scale  parmeter  (24:618). 

The  censored  sample  distance  is  defined  as 
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Integrating,  this  yields 


SUP  above 


Figure  4.  Kuiper  Distance 


W  -  2-1  (<i/n>F.,<x>  -  ( 1/2)  Fg  (x ) 


[i=3  \ 


)\x,  J 
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which  is  the  Watson  distance  measure  -For  censored  samples. 


Empirical  Distribution  Function  Distance 

Another  class  of  distances  is  based  upon  the  empirical 
characteristic  function  (25:1206).  Using  equation  2-5  as 
the  empirical  characteristic  -function  and  letting 


dGn(z) ,  -  oo  <  t  <  +00  , 
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typical  distance  measures  include 


6p(Gn,Fa)  » 


J  lcn(t)  - 


C<t;0  >1  W(t)dt 
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b  (Gn,  Fa  )  =  supt  I  Cn  (t )  -  C  (t ;  6  )  I  w  (t  J 
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These  distance  measures  were  not  investigated  in  this  thesis 
but  are  recommended  as  possible  follow  on  work. 


Summary 

The  Kolmogorov,  Cramer — von  Mises,  Anderson-Darl ing, 
Kuiper,  and  Watson  distance  measures  were  used  to  estimate 
the  location  parameter  of  the  Cauchy  distribution.  The  Kuiper 
and  Watson  were  included  because  the  Parr  study  showed  these 
estimators  to  perform  better  with  samples  from  long  tailed 


^  M    A  .  A  u  *  .  m  * 


.  /-.  VVv  v 

*>  V'  «  '  » 


distributions  (24:620).  In  doing  the  location  estimation, 
the  value  used  for  the  scale  parameter  was  the  mle  estimate 
of  the  scale  and  that  value  was  retained  throughout  the 
minimum  distance  estimation.  The  same  process  was  repeated 
in  estimating  the  scale  parameter.  The  Kuiper  and  Watson 
measures  were  used  to  estimate  the  scale  parameter  and  the  mle 
estimate  of  the  location  was  used  and  retained  throughout  the 
MD  process.  Results  of  the  minimum  distance  estimations  are 
discussed  in  Chapter  Six. 
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V.  Monte  Carlo  Analysi s 


Introduction 

Monte  Carlo  analysis  is  the  process  which  is  used  to 
evaluate  the  efficiency  of  the  estimators.  The  process 
involves  generation  of  the  Cauchy  deviates,  computation  of 
the  parameter  estimates,  comparison  of  the  estimates  with 
the  real  parameter  values,  and  finally  comparison  of  the 
estimators  with  each  other.  All  of  the  Monte  Carlo  analysis 
in  this  thesis  was  done  with  the  Vax  11/780  computer  which 
is  owned  and  operated  by  AFIT. 

Generation  of  Data 

The  generation  of  Cauchy  deviates  was  facilitated  by 
the  use  of  the  IMSL  subroutine  6GCAY.  This  routine  was  also 
used  by  Koutrouvelis  in  his  estimations  using  the  empirical 
characteristic  function.  GGCAY  uses  a  ratio  method  to  gen¬ 
erate  the  deviates  as  opposed  to  the  inversion  method 
because  of  the  ratio’s  better  efficiency  ( 15: GGCAY-1 ) . 
Standard  deviates  were  generated  which  had  the  scale  value 
of  one  and  the  location  value  of  zero.  Deviates  from  Cauchy 
distributions  other  than  the  standard  were  generated  by  mul¬ 
tiplying  the  standard  deviate  by  the  scale  value  and  adding 
to  it  the  location  value.  Sample  sizes  of  6,8,10,12,  and  16 
were  generated  with  location  and  scale  values  of  zero  and 
one  respectively.  Sample  sizes  of  6  and  16  were  generated 
with  a  location  value  of  -2  and  a  scale  value  of  5  to  test 
for  invariance.  The  number  of  samples  generated  for  each 
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sample  size  and  location /scale  set  was  lOOO.  Caso’s  study 
recommended  2000  samples  but  stated  that  1000  was  sufficient 
(5:37).  The  sample  size  of  1000  was  used  to  reduce  the  load 
on  computer  resources. 

Initial  Estimates 

The  initial  estimates  were  the  median  for  the  location 
and  the  modified  semi-interquarti le  range  for  the  scale. 

Both  estimates  were  computed  in  subroutine  MEDSEM  which  was 
adapted  from  the  subroutine  used  to  generate  the  maximum 
likelihood  estimators  in  the  Princeton  Study'.  A  prerequi¬ 
site  to  running  the  routine  was  that  the  sample  needed  to  be 
ordered. 

Max i mum  Li kel ihood  Estimates 

The  maximum  likelihood  estimators  were  generated  using 
a  subroutine  called  CMLE  which  was  also  adapted  from  the 
Princeton  study  (1:262).  This  routine  required  the  same 
Cauchy  ordered  sample  and  the  median  and  modified  semi-int- 
erquartile  range  generated  from  MEDSEM.  The  routine  then 
used  an  iterative  procedure  to  generate  the  mles.  One  draw¬ 
back  to  numerical  estimates  of  the  mles  was  the  possibility 
of  divergence  or  non-convergence.  This  routine  as  initially 
written  provided  for  only  a  maximum  of  twenty  iterations  for 
an  estimate  to  converge,  and  hence  produced  for  a  number  of 
non-converging  estimates,  especially  in  the  smaller  sample 
sizes.  Results  of  non-converging  estimates  are  shown  in 


Table  I  in  Appendix  A.  All  of  the  scale  estimates  converged 


and  none  of  the  non-converging  location  estimates  diverged. 
As  shown  in  Table  I,  an  iteration  maximum  of  100  produced 
only  one  non-converging  estimate  in  a  sample  size  of  six  and 
no  non-converging  estimates  for  larger  sample  sizes.  The 
difference  between  the  final  nonconverging  mle  estimates 
was  less  than  .004.  Because  of  these  results,  the  iteration 
maximum  of  100  was  used  in  CMLE. 

Basically,  the  iteration  scheme  proceeded  as  follows. 
First,  the  median  and  semi -interquarti le  range  was’  put  into 
Equations  3-5  to  estimate  the  location  and  3-6  to  estimate 
the  scale.  The  differences  between  the  new  estimates  arid 
the  old  estimates  were  computed.  If  the  absolute  difference 
between  the  new  estimate  and  old  estimate  of  location  was 
less  than  .001  times  the  scale  estimate,  the  estimate  con¬ 
verged.  If  the  absolute  difference  between  the  new  estimate 
and  the  old  estimate  of  scale  was  less  than  .05  times  the 
scale  estimate,  the  scale  estimate  converged.  These  were 
the  criteria  for  convergence  used  in  the  Princeton  Study. 

If  both  estimates  converged,  the  new  estimates  were  returned 
as  the  mle  estimates  of  location  and  scale.  If  one  or  both 
estimates  did  not  converge,  the  iteration  process  repeated 
until  convergence  occured  or  until  the  iteration  maximum  was 
reached.  If  the  iteration  maximum  was  reached,  the  final 
estimates  were  used  as  the  mle  estimates. 

Minimum  Distance  Estimators 

The  minimum  distance  estimators  and  ways  to  estimate 


them  were  discussed  in  Chapter  Four.  Only  one  parameter  was 
estimated  at  a  time.  The  other  parameter  was  held  -fixed  at 
its  mle  estimate  while  the  other  one  was  estimated  by  MD 
methods.  Let  \  represent  the  mle  estimate  of  location  and  \p' 
represent  the  mle  estimate  of  scale.  To  find  which  loca¬ 
tion  estimate  gave  the  minimum  distance,  location  estimates 
in  the  range  from  X±^were  used  to  calculate  the  distance. 

t 

Starting  with  a  location  estimate  equal  to  X  ,  the  estimate 

/ 

was  incremented  by  .Ol^until  it  reached  a  value  value  of 

f  t  i  t 

X  +y.  If  the  minimum  distance  occured  at  X+w,  then  the  esti- 

/  / 

mate  was  extended  to  X+2^,  and  so  forth.  The  estimate  that 

yielded  the  minimum  distance  was  recorded.  The  process  was 

/  / 

repeated  again  starting  with  X  and  .01^  being  subtracted 
from  each  previous  estimate  till  the  estimate  reached  a  value 
of  As  before,  if  the  minimum  distance  occurred  at  X 

/  t 

the  process  was  repeated  to  X-2^,  etc.  Finally,  the  estimate 
that  yielded  the  minimum  distance  was  used  as  the  estimate 

for  the  sample.  The  scale  parameter  was  estimated  in  the  same 

/  , 

same  way  except  that  the  estimator  range  was  from  to  to 

prevent  the  scale  estimate  from  going  below  zero. 

The  distance  measures  for  non-censored  samples  was 
computed  using  the  formulas  provided  by  Stephens  (27:731). 

The  only  requirement  for  use  of  these  formulas  was  that  F  (;< ) 
be  continuous  and  completely  specified;  hense  these  formulas 
can  be  used  with  the  Cauchy  distribution.  Formulas  for  cen¬ 
sored  samples  were  derived  in  Chapter  Four. 
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Kolmogorov  Pi  stance  Estimator 


The  Kol mogorov 


minimum  distance  estimator  for  non-censored  samples  is 
defined  by  Stephens  as  (27:731): 

D  +  =  max  ( 1  <  i  <  n)  C  (i/n)  -  z;  3 
D  ~  =  max(l<i<n)  C  z-,  -  (i/n)  3 
D  =*  max<D*,D  )  5-1 

where  Zj  ,  in  this  thesis,  is  a  variate  from  the  Cauchy  cdf. 
The  minimum  distance  is  calculated  in  the  subroutine  KSMIN. 
KSMIN  first  calculates  sup  using  the  mle  estimates  and  then 
calls  on  the  subroutine  VEDIS.  VEDIS  calculates  both  D+  and 
D  .  When  these  values  are  returned  to  KSMIN,  D  is  determined 
This  distance  is  stored  and  the  procedure  described  in  the 
previous  section  is  initiated  and  proceeds  until  the  esti¬ 
mate  with  the  minimum  distance  is  determined.  Computation 
for  censored  samples  was  discussed  in  Chapter  Four.  The  only 
difference  is  that  i  is  restricted  to  be  between  3  and  n-2. 
This  estimator  was  used  as  a  location  estimator  only. 

Cramer-von  Mises  Estimator .  This  distance  estimator 
was  also  used  only  as  a  location  estimator.  The  computa¬ 
tional  formula  for  non-censored  samples  is  defined  by  Steph¬ 
ens  as  (27:731) : 

n 

W2  -  -  (2i-l)/2n3  +  (l/12n).  5-2 

i-1 

The  computation  of  the  minimum  distance  is  done  in  the  sub¬ 
routine  AEMIN.  AEMIN  is  a  general  subroutine  that  computes 
the  minimum  distance  for  the  CVM,  the  AD,  and  the  Watson 


(location)  estimators.  AEMIN  calls  upon  AEDIS  to  calculate 
the  distance.  AEDIS  calculates  the  distance  in  accordance 
with  the  value  of  IFLAG  where  a  value  of  0  calculates  the 
CVM  distance,  a  value  of  1  calculates  the  AD  distance,  and  a 
value  of  2  calculates  the  Matson  distance.  The  user  sup¬ 
plies  the  value  o-f  IFLAG  in  the  main  program.  The  minimum 
distance  calculated  in  AEMIN  is  also  computed  as  described 
in  the  MD  estimator  section.  For  censored  samples,  AEDIS  is 
modi-fied  to  use  equation  4-8. 

Ander son-Dar ling  Estimator .  The  computational  formula 
for  the  AD  estimator  is  defined  as  Stephens  as  (27:731): 

A2  =  -  ^  53  <2i-l>  Clnzj  +  ln(l-zn+1_j  ) l^/n-n  5-3 

The  minimum  distance  using  this  distance  is  also  done  by 
AEMIN  as  described  in  the  previous  section.  This  distance 
measure  is  used  for  location  estimates  only.  The  computa¬ 
tional  formula  for  censored  samples  is  given  by  4-11.  AEDIS 
is  modified  to  compute  the  distance  for  censored  samples  as 
wel  1  . 

Kuiper  Estimator.  Stephens  presents  the  computational 
formula  for  the  Kuiper  distance  as  (27:731): 

V  =  D+  +  D"  5-4 

The  subroutine  that  computes  the  minimum  distance  is  KUMIN. 
KUMIN  calls  on  VEDIS  to  calculate  D+  and  D~  .  KUMIN  then 
adds  these  two  distances  and  finds  the  minimum  distance  as 
described  earlier.  Both  the  location  and  scale  parameter 
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are  estimated  using  this  distance.  For  censored  samples, 
the  procedure  is  similar  to  -finding  the  Kolmogorov  distance 
-for  censored  samples. 

Matson  Estimator.  The  computational  formula  for 
the  Watson  distance  using  non-censored  samples  is  defined  by 
Stephens  as  (27:731): 


U2  *  W2-  n  (S'-. 5) 


where 


z  —  ^  z./n  5—6 

i=l 

For  estimating  the  location  parameter,  AEMIN  calculates  the 
minimum  distance  and  calls  upon  AEDIS  to  calculate  the  dis¬ 
tance  for  a  particular  location  estimate.  To  estimate  the 
scale  parameter,  the  subroutine  MAMIN  calculates  the  minimum 
distance  and  calls  upon  AEDIS  to  calculate  the  distance  for 
a  particular  estimate  of  scale.  For  censored  samples,  equa¬ 
tion  4-15  is  used  to  estimate  the  location  parameter.  This 
equation  did  not  give  good  estimates  for  the  scale  parameter 
when  using  censored  samples. 


Evaluation  Cr i ter i a 

The  efficiency  or  accuracy  of  an  estimator  can  be 
measured  in  several  ways.  A  common  method  is  to  calculate 
how  close  the  estimate  actually  comes  to  the  parameter  being 
estimated.  This  difference  is  squared,  summed  over  the  num¬ 
ber  of  samples,  and  divided  by  the  number  of  samples.  This 
measure  is  defined  as  the  mean  squared  error  and  is  repre- 
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sen ted  mathematically  as 

n 

MSE  =  £  (  d  -  e\  )2  /n  5-7 

i=I 

where  6  is  the  parameter  and  s'  is  the  parameter  estimate. 

The  MSE  -for  each  estimate  including  the  mle  estimates,  the 
median,  the  modified  semi-interquartile  range,  and  the  min¬ 
imum  distance  estimates  are  recorded.  One  problem  with  the 
MSE  is  that  it  is  not  scale  invariant  (18:58).  Hense,  if 
one  wants  to  test  the  invariance  of  the  estimator,  another 
measure  should  be  used. 

Another  measure  called  the  relative  efficiency  (RE) 
was  found  to  be  used  extensively  throughout  the  literature. 

This  measure  compares  the  MSE  of  one  estimator  to  the  MSE  of 
another  and  is  defined  mathematically  as 

RE  =  MSEk/MSEj  5-8 

In  this  thesis  MSEj  will  represent  the  MSE  of  the  mle  estima¬ 
tors  and  MSEk  will  represent  the  MSE  of  the  non-mle  estimators. 
Thus  an  RE  of  less  than  one  indicates  an  improvement  over  the 
mle  and  an  RE  of  greater  than  one  indicates  no  improvement  over 
the  mle.  Since  the  RE  is  scale  invariant,  it  will  allow  for 
investigation  of  the  invariance  of  the  MD  estimators. 

A  Wal k-Through  the  Computer  Program 

This  section  will  briefly  discuss  how  the  computer  pro¬ 
gram  derived  and  evaluated  the  estimates.  The  main  program 
that  derived  the  location  estimates  is  called  Cauchy  Loca- 
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tion  Parameter  Estimation  and  Efficiencies  (CLPEE)  and  can 
be  found  in  Appendix  B.  The  program  that  derived  the  scale 
estimates  is  very  similar  to  CPLEE  and  is  called  Cauchy 
Scale  Parameter  Estimation  and  Efficiencies  (CSPEE).  It  can 
be  found  in  Appendix  C.  The  first  step  in  either  case  was 
to  generate  the  empirical  distribution  function  since  this 
function  had  to  be  computed  only  once.  Next,  a  Cauchy  random 
sample  was  generated  using  GGCAY  and  ordered  from  lowest  to 
highest  by  a  routine  the  main  program.  MEDSEM  was  then  cal¬ 
led  upon  to  calculate  the  median  and  modified  semi -i terquar — 
tile  range.  The  differences  between  these  values  and  the 
real  parameters  were  calculated,  squared,  and  added  on  to 
each  previous  squared  difference.  This  process  occurred 
after  deriving  each  type  of  estimate.  Next,  CMLE  was  called 
to  derive  the  mle  estimates  using  the  median  and  modified 
semi-interquarti le  range  as  initial  estimates.  After  pro¬ 
cessing  the  MSEs,  the  subroutines  KSMIN,  AEMIN,  and  KUMIN 
were  called  to  estimate  the  location  parameter  or  KUMIN  and 
WAMIN  were  called  to  estimate  the  scale  parameter.  After 
the  final  sample  was  processed,  the  REs  were  calculated  and 
recorded.  The  subroutines  VEDIS  and  AEDIS  used  to  calculate 
the  distances  for  non-censored  samples  can  be  found  in  Ap¬ 
pendix  B  and  C.  The  modified  subroutines  of  VEDIS  and  AEDIS 
used  to  calculate  the  distances  for  censored  samples  can  be 
found  in  Appendix  D.  A  flow  chart  for  CLPEE  is  shown  in 
Figure  5.  The  flow  of  CSPEE  is  similar. 
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CONCLUSIONS  AND  RECOMMENDATIONS 


Resul ts 

Tables  II-l  through  1 1-5  in  Appendix  A  show  the  MSE  and 
RE  for  the  estimators  of  the  (0,1)  parameter  set  -for  non¬ 
censor  ed  samples.  The  tables  show  that  the  Kolmogorov,  Cra¬ 
mer  von-Mises,  and  Anderson  Darling  distance  measures  are 
in-ferior  to  the  mle  and  inferior  to  the  median  as  well.  The 
Kuiper  location  measure  was  sometimes  superior  to  the  median 
and  sometimes  not.  It  was  never  as  good  as  the  mle.  The 
Watson  statistic  was  the  one  that  appeared  to  be  on  the  same 
level  as  the  mle.  For  sample  sizes  8  and  16  it  actually 
outper-f armed  the  mle  and  in  the  other  sample  sizes  it  was 
very  close.  For  the  scale  estimators,  the  results  were 
similar.  The  Kuiper  scale  measure  always  outperformed  the 
modified  semi -interquart i le  range  but  was  never  as  good  as 
the  mle.  The  Watson  scale  estimator  was  slightly  better 
than  the  mle  for  all  sample  sizes  except  6. 

Because  of  closeness  of  the  Watson  location  and  scale 
estimators  to  the  mle  estimates,  another  run  was  made  with 
5000  samples  and  a  new  value  for  DSEED.  The  parameter  val¬ 
ues  remained  the  same.  The  results  are  shown  in  Table  III 
and  in  this  case,  the  mle  outperformed  the  Watson  estima¬ 
tors,  but  only  slightly.  In  all  sample  sizes,  the  Watson 
estimator  was  within  2V.  of  the  mle  except  for  the  location 
estimate  with  sample  size  6. 

Because  the  Kolmogorov,  CVM,  and  AD  estimators  did  not 


perform  as  well  as  expected,  it  was  thought  that  the  reason 
might  be  because  of  the  Cauchy’s  in-finite  mean  of  the  first 
and  last  order  statistic  and  the  infinite  variance  of  the 
second  and  next  to  last  order  statistic.  For  this  reason, 
the  same  runs  were  made  with  the  samples  being  censored  as 
outlined  in  Chapter  Four.  The  results  are  shown  in  Tables 
I 1-6  through  11-10  in  Appendix  A.  The  location  estimates  for 
the  three  estimators  did  improve  in  the  censored  samples,  but 
they  were  still  not  as  good  as  the  mle  or  the  Kuiper  and  Watson 
estimators  for  non-censored  samples.  In  most  cases  they  were 
not  as  good  as  the  median  (In  one  case  only,  the  Komogorov  was 
slightly  better  than  the  median).  The  unusually  high  RE  for 
the  CVM  distance  in  sample  six  was  caused  by  one  very  high  es¬ 
timate.  Similar  REs  were  observed  with  different  seeds  and  in 
sample  sizes  of  seven  as  well.  The  problem  cleared  up  when  a 
sample  size  of  at  least  eight  was  used.  The  Kuiper  estimator 
became  worse  when  censored  samples  were  used  and  approached 
values  very  close  to  the  median.  The  Watson  estimator,  al¬ 
though  still  good,  was  worse  than  its  estimates  for  non-cen- 
sored  samples.  For  the  scale  estimates,  the  results  using 
censored  samples  were  worse.  The  Kuiper  estimates  were  never 
as  good  as  the  modified  semi -interquarti 1 e  range  and  the  Wat¬ 
son  estimators  failed  to  run  at  all.  Since  every  Watson  esti¬ 
mate  was  consistently  bad,  the  MSEs  and  REs  were  not  recorded. 

Finally,  the  invariance  of  the  estimators  was  tested. 
With  the  same  seed  values  and  using  non-censored  samples, 
sample  sizes  of  6  and  16  were  run  with  a  parameter  set  of 
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(-2,5).  Table  1 1  —  1 1  shows  the  results  for  a  sample  size  of 
six  and  Table  11-12  shows  the  results  for  a  sample  size  of 
sixteen.  A  comparison  of  these  tables  with  Tables  1 1  —  1  and 
1 1-5  reveal  that  the  RE  for  each  estimator  did  not  change 
significantly  with  the  change  in  parameter  values.  In  fact, 
the  largest  change  was  less  than  .1  per  cent.  Any  change  at 
all  was  due  to  round  off  error.  Thus  the  minimum  distance 
estimators  for  the  Cauchy  distribution  are  invariant  to 
different  paramteter  values  and  thus  Parr's  assertion  is 
confirmed  (24*617). 

Conclusions 

The  censored  samples  outperformed  the  non-censored  sam¬ 
ples  in  the  case  of  the  Kolmogorov,  Cramer  von-Mises,  and 
Anderson-Darl ing  estimators  of  location.  However,  the  im¬ 
provement  was  not  enough  to  justify  using  them  because  their 
estimates  were  still  not  as  good  as  the  mle  or  the  median. 

On  the  other  hand,  the  non-censored  samples  outperformed  the 
censored  samples  in  the  case  of  the  Kuiper  ant'  Watson  statis¬ 
tics.  This  was  true  for  both  the  location  and  scale  esti¬ 
mates.  Clearly,  the  Watson  statistic,  using  non-censored 
samples,  was  the  best  of  all  in  either  case  as  a  location  or 
scale  estimator  and  was  always  almost  as  good  as  the  mle. 
However,  there  was  still  no  real  improvement  over  the  mle  and 
hense  no  attempt  was  made  to  improve  the  opposite  parameter, 
as  described  in  the  methodology  section  of  Chapter  One. 

Since  no  real  improvement  over  the  mle  estimate  has 
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been  found,  what  has  been  gained  from  this  research?  For 
one  thing,  this  study  did  reveal  that  the  Watson  statistic 
is  an  excellent  distance  measure  for  the  Cauchy  distribu¬ 
tion.  Hense  this  distance  measure  might  be  useful  in  a 
goodness-of-f i t  test  as  described  in  Chapter  Four.  The 
goodness-of -f i t  test  would  be  useful  to  determine  if  a  par¬ 
ticular  data  set  is  Cauchy  distributed  and  the  Watson  dis¬ 
tance  measure  might  provide  a  more  powerful  test  than  now 
exists  for  the  Cauchy.  The  Kuiper  statistic  might  provide 
a  powerful  test  as  well. 

Fol low-on  Work 

The  following  topics  are  recommended  as  potential  fol¬ 
low-on  work: 

1)  Because  the  Kuiper  and  Watson  estimators  were  very 
good  estimators  for  the  Cauchy  distribution,  they  have  a  lot 
of  potential  in  being  used  as  distance  measures  in  goodness- 
of-fit  tests  as  previously  described.  One  could  use  mle 
methods  to  estimate  the  parameters  of  the  hypothesized 
Cauchy  distribution  and  then  use  Monte  Carlo  analysis,  with 
the  Watson  and  Kupier  statistics  as  distance  measures,  to 
derive  the  necessary  goodness-of -f i t  tables.  The  power  of 
the  goodness-of-f it  tests  using  these  distance  measures 
could  be  compared  to  other  goodness-of-f it  tests  using  the 
Kolmogorov,  CVM,  and  AD  distance  measures. 

2)  There  is  an  IMSL  subroutine  called  ZXMIN  which  will 
minimize  a  function  of  several  variables.  Parr  used  this 


routine  to  determine  the  minimum  distance  estimators  in  his 
minimum  distance  study  (24:621).  One  could  use  this  routine 
to  estimate  both  parameters  simultaneously  using  the  median 
and  semi-interquarti le  range  as  initial  estimates.  A  com¬ 
parison  between  the  mle  and  Watson  estimators  could  then  be 
made  to  measure  the  efficiency  in  terms  of  the  computer  time 
required  to  calculate  the  estimates.  The  accuaracy  of  the 
estimators  could  be  double-checked  as  well.  Also,  the  ro¬ 
bustness  properties  of  the  MD  estimators  could  be  explored 
and  compared  with  the  robustness  properties  of  the  mle. 

This  would  be  done  by  generating  samples  from  different 
distributions,  such  as  the  normal,  and  using  these  samples  to 
estimate  the  parameters.  Even  if  the  Watson  estimators  are 
not  as  accurate  as  the  mle,  perhaps  the  speed  at  which  they 
can  be  calculated  and  their  robustness  properties  might 
justify  their  use. 

3)  Finally,  there  is  another  distance  estimator  derived 
from  the  characteri stic  function,  which  was  described  in 
Chapter  Four.  This  estimator  could  also  be  used  to  estimate 
the  Cauchy  parameters  and  its  efficiency  compared  to  other 
estimators.  An  estimator  to  compare  with  the  characteristic 
function  distance  measure  might  be  the  one  derived  by  Kou- 
trouvelis  (see  Chapter  Three)  since  he  used  the  character — 
istic  function  to  derive  his  estimator.  This  topic  might  be 
also  expanded  to  other  distributions  that  follow  stable  law 
and  that  do  not  have  closed  form  expressions  such  as  the 
normal  and  Cauchy. 
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TABLE 

1 1  —  1 

MSE  and  RE 

of  Estimators 

Sample  Size:  6 
Location  Parameter: 
Number  of  Samples: 

DSEED :  4385673 

0  Scale  Parameter 

1 000 

:  1 

Non-censored  Samples 

Location  Estimator 

MSE 

RE 

Med i an 

.3157362 

1.275193 

Maximum  Likelihood 

. 6396965 

- 

Kolmogorov 

1.0176807 

1 . 590879 

Cramer  von-Mises 

. 9793309 

1.530931 

Anderson-Dar 1 i ng 

. 9884748 

1 . 545225 

Kui per 

. 6723628 

1.051065 

Watson 

. 6968500 

1 . 089345 

Scale  Estimator 

MSE 

RE 

Mod  Semi -Interquarti 1 e  Range 

. 8787380 

1 . 577240 

Maximum  Likelihood 

. 5571 364 

- 

Kui per 

. 5901 666 

1 . 059286 

Watson 

. 5886748 

1 . 056608 

TABLE  1 1-2 

MSE  and  RE  o-f  Estimators 

Sample  Size:  8  DSEED:  4385673 

Location  Parameter:  0  Scale  Parameter: 

Number  of  Samples:  1000 


Nan-censored  Samples 


Location  Estimator 

Medi an 

Maximum  Likelihood 
Kol mogorov 
Cramer  von-Mises 
Anderson-Dar ling 
Kui per 
Watson 

Scale  Estimator 

Mod  Semi -Interquarti 1 e  Range 
Maximum  Likelihood 
Kui per 
Watson 


MSE 


. 5876736 
. 5347122 
. 7218777 
. 6334487 
. 6576732 
. 5523942 
. 4949655 

MSE 

. 6879840 
. 4572869 
. 4698043 
. 4529167 


1 

1 

1 

1 

1 


1 


1 


RE 

.  099047 

.  35003 1 
. 184654 
. 229957 
. 033068 
. 925667 

RE 

.504491 

. 027373 
. 990443 


TABLE  1 1-3 


MSE  and  RE  of  Estimators 


Sample  Size:  10 
Location  Parameter:  0 
Number  of  Samples:  1000 


DSEED:  4385673 
Scale  Parameter 


Non-censored  Samples 


Location  Estimator 

MSE 

RE 

Medi an 

.  3488217 

1 . 132660 

Maximum  Likelihood 

.  3079669 

- 

Kol mogorov 

.51 14189 

1 . 660629 

Cramer  von-Mises 

. 3902656 

1.267232 

Anderson-Dar 1 i ng 

. 4125222 

1 . 339502 

Kui per 

. 3627019 

1 . 177730 

Watson 

.3081932 

1 . 000735 

Scale  Estimator 

MSE 

RE 

Mod  Semi - Interquart i 1 e  Range 

. 4591780 

1 . 605180 

Maximum  Likelihood 

.  2860602 

- 

Kui per 

. 3225700 

1 . 127630 

Watson 

.  2656335 

. 928593 

TABLE  1 1-4 


MSE  and  RE  of  Estimators 


Sample  Size:  12 
Location  Parameter: 
Number  of  Samples: 

Non-censored  Samples 

DSEED:  4385673 

0  Scale  Parameter 

1000 

:  1 

Location  Estimator 

MSE 

RE 

Medi an 

. 2942365 

1 . 223874 

Maximum  Likelihood 

. 2404139 

- 

Kol mogorov 

.3974717 

1.653281 

Cramer  von-Mises 

.3191490 

1 . 327498 

Anderson-Dar 1 i ng 

. 3374594 

1 . 403660 

Kui per 

. 2810406 

1 . 168986 

Watson 

.2441652 

1 .015603 

Scale  Estimator 

MSE 

RE 

Mod  Semi -Interquarti 1 e 

Range 

. 2924091 

1 . 328926 

Maximum  Likelihood 

. 2200342 

- 

Kui per 

.  2*185952 

1 . 129802 

Watson 


2132684 


991975 


TABLE  1 1 -5 


MSE  and  RE 

Sample  Size:  IS 
Location  Parameter: 
Number  o-f  Samples: 

Non-censored  Samples 

o-f  Estimators 

DSEED:  4385673 

0  Scale  Parameter 

1 000 

:  1 

Location  Estimator 

MSE 

RE 

Median 

. 1928420 

1 . 152691 

Maximum  Likelihood 

. 1672972 

- 

Kol moaorov 

. 2593825 

1 . 550429 

Cramer  von-Mises 

.2113738 

1 . 263462 

Anderson-Dar 1 i nq 

. 2240063 

1 . 338972 

Kui per 

. 205651 1 

1 . 229256 

Watson 

. 1665640 

.995617 

Scale  Estimator 

MSE 

RE 

Mod  Semi -Interquarti 1 e 

Range 

.2096114 

1 . 330707 

Maximum  Likelihood 

. 1575189 

- 

Kui per 

. 1663357 

1.055973 

Watson 

. 1563607 

. 992648 

TABLE  1 1-6 


MSE  and  RE  of  Estimators 

Sample  Size:  6  DSEED:  4385673 

Location  Parameter:  0  Scale  Parameter: 

Number  of  Samples:  1000 


Censored  Samples 

Location  Estimator 

Median 

Maximum  Likelihood 

Kolmogorov 

Cramer  von-Mises 

Ander son-Dar ling 

Kuiper 

Watson 


Scale  Estimator 


MSE 

.8157362  1 

. 63969^  5 

.8151365  1 

32.71764  51 

.8151777  1 

. 8085236  1 

. 7480507  1 


MSE 


Mod  Semi -Interquarti 1 e  Range 

Maximum  Likelihood 

Kuiper 


. 8787373 


.  5571364 


.  2653000 


1 


4 


RE 

275193 

274255 

14557 

274320 

263917 

169384 

RE 

577240 

761850 


Watson 


TABLE  1 1-7 


MSE  and  RE  o-f  Estimators 


Samole  Size:  8 
Location  Parameter:  0 
Number  o-f  Samples:  1000 


DSEED:  4385673 
Scale  Parameter 


Censored  Samples 


Location  Estimator 

MSE 

RE 

Median 

. 5876736 

1 . 099047 

Maximum  Likelihood 

.5347122 

- 

Kolmogorov 

.6621354 

1 . 238303 

Cramer  von-Mises 

. 5948694 

1. 112504 

Anderson-Darl i ng 

. 6033888 

1 . 256496 

Kui per 

.6713636 

1 . 256496 

Watson 

.5576661 

. 042928 

Scale  Estimator 

MSE 

RE 

Mod  Semi -Interquart i le  Range 

. 6879840 

1.504491 

Maximum  Likelihood 

. 4572869 

- 

Kui per 

1.270862 

2.779135 

Watson 

_ 

— 

TABLE  1 1-8 


MSE  and  RE  o-f  Estimators 


Samole  Size:  10 
Location  Parameter: 
Number  o-f  Samoles: 

Censored  Samples 

DSEED : 
0  Scale 

1 000 

4385673 
Parameter:  1 

Location  Estimator 

MSE 

RE 

Medi an 

. 3488217 

1 . 132660 

Maximum  Likelihood 

. 3079669 

- 

Kol moaorav 

. 4796595 

1 . 557503 

Cramer  von-Mises 

. 3603502 

1 . 1 70094 

Anderson-Dar 1 i ng 

. 3715640 

1 . 206506 

Ku i o  er 

. 3900580 

1 . 266448 

Watson 

. 3348452 

1 . 087276 

Scale  Estimator 

MSE 

RE 

Mod  Semi -Interquarti le  Range 

. 4591780 

1 .605180 

Maximum  Likelihood 

. 2860602 

- 

Kuiper 

. 5427296 

1 .897257 

Watson 

_ 

_ 

56 


TABLE  1 1-9 

MSE  and  RE  of  Estimators 


Samole  Sine:  12 
Location  Parameter:  0 
Number  of  Samoles:  1000 


DSEED :  4385673 
Scale  Parameter 


Censored  Samples 


Location  Estimator 

MSE 

RE 

Medi an 

. 2942365 

1 . 223874 

Maximum  Likelihood 

.2404139 

- 

Kol moaorov 

. 3703235 

1 . 540358 

Cramer  von-Mises 

. 2957745 

1 . 230272 

Anderson-Dar 1 ing 

. 3054909 

1 . 270687 

Kui oer 

. 2920988 

1.214983 

Watson 

. 2707492 

1 . 126179 

Scale  Estimator 

MSE 

RE 

Mod  Semi -Inter quarti 1 e  Range 

. 2^2409 1 

1 . 328926 

Maximum  Likelihood 

. 2200342 

- 

Kui per 

.3248410 

1 . 476320 

Watson 

— 

— 

TABLE  1 1 -10 


MSE  and  RE 

Samole  Size:  16 
Location  Parameter 
Number  of  Samoles: 

Censored  Samples 

of  Estimators 

DSEED : 

:  0  Scale 

1 000 

4335673 
Parameter:  1 

Location  Estimator 

MSE 

RE 

Medi an 

. 1928420 

1 . 152691 

Maximum  Likelihood 

. 1672972 

- 

Kol mogr ov 

.2468639 

1 . 475600 

Cramer  von-Mises 

. 20 1 2539 

1 . 202972 

Anderson-Dar lino 

.2089179 

1 . 243783 

Kui Der 

. 1998791 

1. 194754 

Watson 

. 1795871 

1.073461 

Scale  Estimator 

MSE 

RE 

Mod  Semi -Interquarti 1 e  Range 

. 2096114 

1 . 330707 

Maximum  Likelihood 

. 1575189 

- 

Ku i o  er 

. 1906907 

1.210590 

Watson 


TABLE  1 1 - 1 1 


MSE  and  RE 

Samole  Size:  6 
Location  Parameter: 
Number  of  Samoles: 

Non-censored  Samples 

of  Estimators 

DSEED :  4385673 
-2  Scale  Parameter 

1 000 

:  5 

Location  Estimator 

MSE 

RE 

Medi an 

20393 . 40 

1.275192 

Maximum  Likelihood 

15992.41 

- 

Kcl moaorov 

25442. 00 

1 . 5Q0879 

Cramer  von-Mises 

24483.27 

1.530931 

Anderson-Dar 1 i nq 

24710. 00 

1.545108 

Ku icer 

1 6309 . 07 

1.051066 

Watson 

17421.26 

1 . 089345 

Scale  Estimator 

MSE 

RE 

Mod  Semi -Inter quarti  1  e 

Range 

21968.45 

1 . 577240 

Maximum  Likelihood 

13928.41 

- 

Kuioer 

14754.24 

1.059291 

Watson 

14727.70 

1 . 057386 

TABLE  I  1 1 


Location  Estimator 

MSE 

RE 

! 

Samole  Size 

j 

6 

. 7371190 

i 

1 . 147544 

3 

. 3979722 

1 . 017073 

10 

.2391362 

1.018591 

12 

. 2228722 

1 .010221 

16 

. 1566628 

1 . 002658 

Scale  Estimator 

MSE 

RE 

6 

. 6068876 

1 . 007577 

8 

. 41 18958 

1 . 010174 

10 

. 2377366 

1 . 003966 

12 

. 2321404 

1 . 00326 1 

16 

. 1554233 

1 . 002658 

Appendix  B 

Computer  Program  Listing  o-f  CLPEE 


********************* ************** ************ ************ 
*  * 

*  PROGRAM  * 

*  * 

*  CAUCHY  LOCATION  PARAMETER  ESTIMATION  AND  * 

*  EFFICIENCIES  (CLPEE)  * 

*  * 

*  WRITTEN  BY  CAPT  JOHN  0  SOURS,  AFIT/GSO-85D,  FOR  * 

*  MS  THESIS  * 

*  * 

*  DECEMBER  1985  * 

*  * 
*********************************************************** 


PROGRAM  CLPEE 

*********************************************************** 


PURPOSE:  TO  COMPARE  PARAMETER  ESTIMATION  TECHNIQUES  OF  * 
THE  LOCATION  PARAMETER  OF  THE  TWO  PARAMETER  * 
CAUCHY  DISTRIBUTION.  THE  TECHNIQUES  COMPARED  * 
ARE  THE  MAXIMUM  LIKELIHOOD  METHOD  AND  MINIMUM  * 
DISTANCE  METHODS.  * 


VARIABLES:  R(20)  -  ARRAY  WHICH  HOLDS  THE  CAUCHY  * 

DEVIATES  * 

EDF (20) -  ARRAY  WHICH  HOLDS  VALUES  OF  THE  * 

EMPIRICAL  DISTRIBUTION  FUNCTION  * 

PI  -  3.1415926  RADIANS  * 

N  -  SAMPLE  SIZE  * 

RN  -  REAL  VALUE  FOR  SAMPLE  SIZE  * 

DSEED  -  SEED  VALUE  USED  FOR  GENERATING  * 

CAUCHY  DEVIATES  IN  GGCAY  * 

WK ( 60 )  -  ARRAY  USED  IN  GGCAY  FOR  WORKSPACE  * 

(3  TIMES  R ( I ) )  * 

LOC  -  LOCATION  PARAMETER  VALUE  OF  THE  * 

CAUCHY  DISTRIBUTION  FROM  WHICH  THE  * 
DEVIATES  ARE  TAKEN  * 

SCALE  -  SCALE  PARAMETER  VALUE  OF  THE  * 

CAUCHY  DISTRIBUTION  FROM  WHICH  THE  * 
DEVIATES  ARE  TAKEN  * 

I FLAG  -  FLAG  USED  TO  DETERMINE  WHICH  D IS-  * 

TANCE  IS  TO  BE  USED  IN  AEMIN  * 

(CRAMER-VON  MISES. ANDERSON-DARLING, * 


SCALE  - 


I FLAG  - 
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c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


* 

* 

* 

* 

# 

« 

DC 

DC 

Dt 

Dc 

* 

Dc 

Dc 

Dc 

Dc 

Dc 

DC 

DC 

Dc 

Dt 

Dc 

DC 

Dc 

Dc 

Dc 

Dt 

Dc 

Dc 

Dc 

Dc 

Dc 

Dt 

Dc 

Dc 

Dc 

Dc 

Dc 

Dc 

Dt 

Dt 

Dc 

Dc 

Dt 

Dc 

Dc 

Dc 

Dc 

Dc 

Dt 

Dc 

Dc 

Dc 

Dt  Dt 


SSZE 

SUMME 

SUMML 

SUMKS 

SUMCM 

SUMAD 

SUMKU 

SUMWA 

EME 

EKS 

ECM 

EAD 

EKU 

EWA 

DME 

DMEQ 

DML 

DMLQ 

DKS 

DKSQ 

DCM 


DCMQ 

DAD 


DADQ 

DKU 

DK.UQ 

DWA 

DWAQ 

MEDIAN 

SEMIQ 

MLEL 

MLES 

KSL 

CML 

ADL 

KUL 

WAL 


************** 


□R  WATSON)  Dt 

STEP  SIZE  USED  IN  THE  MINIMUM  Dc 

DISTANCE  TECHNIQUES  Dt 

SUM  OF  THE  MEAN  SQUARED  ERRORS  * 

FOR  THE  MEDIAN,  MLE  LOCATION,  DC 

KOLMOGOROV  DISTANCE,  Dt 

CRAMER  VON-MISES  DISTANCE,  ANDERSON-  Dt 
DARLING  DISTANCE, KU I PER  DISTANCE,  * 

AND  THE  WATSON  DISTANCE  Dt 

Dt 

SUMME /SUMML  * 

SUMKS /SUMML  * 

SUMCM /SUMML  Dt 

SUMAD /SUMML  Dc 

SUMKU /SUMML  Dc 

SUMWA /SUMML  * 

DIFFERENCE  BETWEEN  THE  LOC  AND  Dc 

MEDIAN  * 

DME  SQUARED  (MEAN  SQUARED  ERROR)  Dc 

DIFFERENCE  BETWEEN  THE  LOC  AND  Dt 

MLE  OF  LOCATION  Dt 

DML  SQUARED  (MEAN  SQUARED  ERROR)  Dc 

DIFFERENCE  BETWEEN  THE  LOC  AND  Dt 

KOLMOGOROV  ESTIMATE  OF  LOCATION  Dt 

DKS  SQUARED  (MEAN  SQUARED  ERROR)  Dr 

DIFFERENCE  BETWEEN  THE  LOC  AND  Dt 

CRAMER  VON-MISES  ESTIMATE  OF  Dt 

LOCATION  Dt 

DCM  SQUARED  (MEAN  SQUARED  ERROR)  Dc 

DIFFERENCE  BETWEEN  THE  LOC  AND  * 

ANDERSON-DARLING  ESTIMATE  OF  Dc 

LOCATION  Dt 

DAD  SQUARED  (MEAN  SQUARED  ERROR)  Dt 

DIFFERENCE  BETWEEN  THE  LOC  AND  * 

KUIPER  ESTIMATE  OF  LOCATION  * 

DKU  SQUARED  (MEAN  SQUARED  ERROR)  Dc 

DIFFERENCE  BETWEEN  THE  LOC  AND  Dc 

WATSON  ESTIMATE  OF  LOCATION  Dc 

DWA  SQUARED  (MEAN  SQUARED  ERROR)  Dc 

THE  MEDIAN  OF  THE  SAMPLE  Dc 

MODIFIED  SEMI -INTERQUARTILE  RANGE  Dc 

MLE  ESTIMATE  OF  LOCATION  Dc 

MLE  ESTIMATE  OF  SCALE  Dc 

KOLMOGOROV  ESTIMATE  OF  LOCATION  Dc 

CRAMER  VON-MISES  ESTIMATE  OF  * 

LOCATION  Dt 

ANDERSON-DARLING  ESTIMATE  OF  Dt 

LOCATION  Dt 

KUIPER  ESTIMATE  OF  LOCATION  Dt 

WATSON  ESTIMATE  OF  LOCATION  Dt 

Dt 

DC 


Hftt  *********************************** 


onntnnnnonnn  nn 


COMMON  R (20) . EDF (20) , PI , N, RN 

REAL  WK  <  60 ) . KSL . KUL . LOG , MLEL , MLES . MED I AN 

DOUBLE  PRECISION  DSEED 

PRINT*, 'ENTER  SAMPLE  SIZE’ 

READ* , N 

PRINT*, 'ENTER  #  REPLICATIONS’ 

READ*, NR 

PRINT*, ’ INPUT  DSEED  <0  FOR  DEFAULT  VALUE  OF  4385673)' 
READ*, DSEED 
IF  (DSEED. EQ.O)  THEN 
DSEED=4385673. 

END  IF 

PRINT*, ' INPUT  LOCATION  AND  SCALE  PARAMETER' 

READ*, LOC, SCALE 

OPEN ( UN IT=10,FI LE= ’ DATA’ , STATUS=’ NEW’ ) 

WRITEdO,*)’  ’ 

WRITE ( 10, *) 'SAMPLE  SIZE  IS  ',N. ’  LuC  IS  ’,LOC,’  SCALE  IS 
> ’ , SCALE 
WRITEdO,*)’ 

PI=3. 1415926 

SSZE=. 01 

RN=N 

SUMME=0 

SUMML=0 

SUMKS=0 

SUMCM=0 

SUMAD=0 

SUMKU=0 

SUMWA=0 

CALCULATE  EMPIRICAL  D I SR I BUT I ON  FUNCTION 

DO  6  1=1, N 

EDF ( I ) =I/RN 
CONTINUE 
DO  75  L=1 , NR 

GENERATE  CAUCHY  DEVIATES 

CALL  GGCAY  (DSEED, N, WK, R) 

DO  5  1=1, N 

R ( I ) =R ( I ) *SCALE+LOC 
CONTINUE 

ORDER  MATRIX  OF  CAUCHY  DEVIATES 

DO  25  I = 1 , N 

DO  35  J=1 , N— 1 
IF  < R  <  J ) . GT . R ( J+ 1 ) )  THEN 
TEMP=R ( J) 
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R(J)=R  (J+l  > 

R(J+1)=TEMP 
END  IF 

35  CONTINUE 

25  CONTINUE 

C 

C  COMPUTE  MEDIAN  AND  MODIFIED  SEMI - INTERQUART ILE  RANGE 
C 

CALL  MEDSEM  (MED I AN, SEMIQ) 

DME=LOC-MEDIAN 
DMEQ=DME**2 
SUMME =SUMME+DME  Q 
C 

C  CALCULATE  MLE  OF  LOCATION  AND  SCALE 
C 

CALL  CMLE  (MEDIAN, SEMIQ, MLEL, MLES) 

C 

DML=LOC-MLEL 

DMLQ=DML**2 

SUMML=SUMML+DMLQ 

C 

C  MINIMIZE  KOLMOGOROV  DISTANCE  TO  ESTIMATE  LOCATION  PARAMETER 
C 

CALL  KSMIN  (S3ZE. MLEL, MLES, KSL) 

DKS=LOC-KSL 
DKSQ=DKS**2 
SUMKS— SUMKS+DKSQ 
C 

C  MINIMIZE  CRAMER  VON-MISES  DISTANCE  TO  ESTIMATE  LOCATION 
C  PARAMETER 

C 

IFLAG=0 

CALL  AEMIN  (SSZE, MLEL, MLES, CML, IFLAG) 

DCM=LQC-CML 

DCMQ=DCM**2 

SUMCM=SUMCM+DCMQ 

C 

C  MINIMIZE  ANDERSON-DARLING  DISTANCE  TO  ESTIMATE  LOCATION 

C  PARAMETER 

C 

IFLAG=1 

CALL  AEMIN  (SSZE, MLEL, MLES, ADL, IFLAG) 

DAD=LOC-ADL 

DADQ=DAD**2 

SUMAD=SUMAD+DADQ 

C 

C  MINIMIZE  KU I PER  DISTANCE  TO  ESTIMATE  LOCATION  PARAMETER 

C 

CALL  KUMIN  (SSZE, MLEL, MLES, KUL) 

DKU=LOC— KUL 
DKUQ=DKU*!»2 
SUMKU=SUMKU+DKUQ 
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UUUU  N  <J  CJ 


MINIMIZE  THE  WATSON  DISTANCE  TO  ESTIMATE  THE  LOCATION 
PARAMETER 


IFLAG=2 

CALL  AEMIN  (SSZE, MLEL, MLES, WAL, I FLAG) 

DWA=LOC-WAL 

DWAQ=DWA**2 

SUMWA=SUMWA+DWAQ 

CONTINUE 

EME=SUMME/SUMML 

EKS*SUMKS/SUMML 

ECM=SUMCM/SUMML 

E AD=SUMAD / SUMML 

EKU=SUMKU / SUMML 

EWA=SUMW A / SUMML 


WRITE ( 
WRITE ( 
WRITE- 
WRITE ( 
WRITE ( 
WRITE ( 
WRITE ( 
WRITE ( 
WRITE ( 
WRITE ( 
WRITE ( 
WRITE ( 
WRITE < 
WRITE ( 
STOP 


10,  *> 
10,  *> 
10,  *) 
10,  *) 
10,  *) 
10,  *) 
10,*) 
10,  *) 
10,  *) 
10,  *) 
10,  *) 
10,  *) 
10,  *) 
10,  *) 


'  SUMME 
■  SUMML 
'  SUMKS 
'  SUMCM 
’  SUMAD 
’  SUMKU 
’  SUMWA 

9  9 

’EME  = 
’  EKS  = 
'  ECM  = 
’  EAD  = 
’  EKU  = 
*  EWA  = 


=  ’ , SUMME 
=  ’ , SUMML 
=  ’ , SUMKS 
=  ' , SUMCM 
=  ’ , SUMAD 
=  ’ , SUMKU 
=  ’ , SUMWA 

»  ,  EME 
’  ,  EKS 
'  .ECM 
*  ,  EAD 
'  ,  EKU 
’  ,  EWA 


END 


C 

C 

C 

C 

C 

C 

C 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


SUBROUTINE  MEDSEM  (MEDIAN, SEMI Q> 


*********************************************************** 
*  * 

*  PURPOSE:  TO  COMPUTE  THE  MEDIAN  AND  MODIFIED  SEMI -INTER-* 

*  QUART I LE  RANGE  OF  THE  CAUCHY  RANDOM  SAMPLE  * 


* 

* 

*  VARIABLES:  IH.IL.MH 

* 

* 

* 

*  D1.D2 

* 

* 

*  SEMIQ1 

* 

* 


* 

* 

INTEGER  VARIABLES  USED  * 

IN  DETERMINING  THE  MODI-  * 

FIED  SEMI-INTERQUARTILE  * 

RANGE  * 

DIFFERENCES  BETWEEN  THE  * 

MEDIAN  AND  CAUCHY  RANDOM  * 

VARIATES  * 

VALUE  USED  IN  AVERAGING  * 

THE  RANGE  IF  THE  SAMPLE  * 


SIZE  IS  EVEN  AND  EQUAL  TO  * 


*.  % 
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*  THE  RANGE  IF  THE  SAMPLE  * 

*  SIZE  IS  ODD  * 

*  * 

*  NOTE:  UNDEFINED  VARIABLES  ARE  DEFINED  IN  MAIN  PROGRAM  * 

*  * 

*  * 


***********  **********  ******************1!*  *******  *********** 


COMMON  R ( 20 ) , EDF ( 20 ) , P I , N , RN 
REAL  MEDIAN 
MH= (N+2) /2 

MEDI AN=. 5* (R(N-MH+1)+R(MH) ) 

SEMIQ=0 

IH=N 

IL=1 

DO  1  1=1, MH 

D1  =  MEDI AN-R ( IL) 

D2  =  R(IH) -MEDIAN 

SEM I Q 1 =SEM I Q 

IF  (D1.GT.D2)  GOTO  2 

IH=IH— 1 

SEMIQ=D2 

GOTO  1 

IL=IL+1 

SEMIQ=D1 

CONTINUE 

IF  ( MOD ( N , 2 ) . EQ . 0 )  SEMIQ=. 5* (SEMIQ+SEMIQ1 > 

RETURN 

END 


SUBROUTINE  CMLE  (MEDI AN, SEMIQ, MLEL, MLES) 


*********************************************************** 


PURPOSE: 


TO 
OF 
CAUCHV 


VARIABLES: 


CALCULATE  THE  MAXIMUM  LIKELIHOOD  ESTIMATORS* 
THE  LOCATION  AND  SCALE  PARAMETERS  FROM  THE  * 
SAMPLE  * 

* 
* 

-  MAXIMUM  NUMBER  OF  ITERATIONS  ALLOWED* 


I  MAX 
ITER 
MLELT 
MLEST 


SUMO 

SUM1 

MLESSQ 

Z 


ITERATION  COUNTER 

VALUE  OF  MLE  OF  LOCATION/SCALE 

PARAMETERS  AT  THE  BEGINNING  OF  EACH 

ITERATION 

SUMMATION  VARIABLES  USED  IN  THE  MLE 
ITERATION  PROCESS 
MLES  SQUARED 

VARIABLE  USED  IN  THE  MLE  ITERATION 
PROCESS 


c 

c 

c 

c 

c 

c 

c 

c 

c 


20 


10 


99 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


*  TMLES  -  SQUARE  ROOT  OF  THE  MLE  ESTIMATE  OF  * 

*  SCALE  * 

*  * 

*  NOTE:  UNDEFINED  VARIABLES  ARE  DEFINED  IN  MAIN  PROGRAM  * 

*  * 

*  * 

*********************************************************** 


COMMON  R ( 20  > , EDF ( 20 ) , P I , N , RN 

REAL  MLEL , MLES , MED I AN , MLELT , MLEST . MLESSQ 

MLEL=MED I AN 

MLES=SEMIQ 

IMAX=100 

ITER=0 

MLELT=MLEL 

MLEST=MLES 

SUM0=0. 

SUM 1=0. 

MLESSQ=MLES*  *2 
DO  10  I  =  l.N 
Z=MLESSQ+ (R ( I ) -MLEL) **2 
SUM0=SUM0+1 . /Z 
SUM 1 =SUM 1 +R ( I ) / Z 
CONTINUE 

TMLES=DFLOAT (N) /2/SUMO/MLES** (1.5) 

MLES=TMLES**2 

MLEL=SUM1 /SUMO 

ITER=ITER+1 

IF  (ITER.GT. IMAX)  GOTO  99 

IF  ( ABS (MLEL-MLELT )  .  GT  .  .0Ol*MLES>  GOTO  20 
IF  ( ABS ( MLES-MLEST ) . GT .  .05*MLES>  GOTO  20 

RETURN 
END 


SUBROUTINE  KSMIN  (SSZE, RMLEL, RMLES, MKSL) 


**************************************************  »  *****  ::  ‘  * 
*  * 

*  PURPOSE:  TO  PROVIDE  THE  LOCATION  ESTIMATE  THAT  MINIMIZES* 

*  THE  KOLMOGOROV  DISTANCE  * 

*  * 

-  VERTICAL  DISTANCE  FROM  THE  CDF  TO  * 

THE  EDF  ABOVE/BELOW  THE  CDF  * 

-  THE  INCREMENT  ADDED  ON  TO  OR  SUB-  * 

TRACTED  FROM  THE  PREVIOUS  ESTIMATE  * 

-  THE  NUMBER  OF  STEPS  THAT  ARE  ADDED  * 

ON  TO  OR  SUBTRACTED  FROM  THE  INITIAL* 
ESTIMATE  * 

-  KOLMOGOROV  DISTANCE  WITH  THE  MLE  * 


*  VARIABLES:  DISA 

*  DISB 

*  STEP 

* 

*  ITER 

* 

* 

*  CKSDS 


68 


c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


u 


* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 


CKSL 

RKSDIS 

LKSDIS 

EKSL 

RKSDS 

LKSDS 

RKSL 

LKSL 

J,K 


MKSDS 

MKSL 

RMLEL 

RMLES 


ESTIMATE  AS  THE  PARAMETER  VALUE 
INITIAL  PARAMETER  VALUE 
KOLMOGOROV  DISTANCE  AT  EACH  STEP 
TO  THE  RIGHT/LEFT  OF  THE  INITIAL 
ESTIMATE 

PARAMETER  VALUE  AT  EACH  INCREMENT 
THE  MINIMUM  KOLMOGOROV  DISTANCE  TO 
THE  RIGHT/LEFT  OF  THE  INITIAL 
ESTIMATE 

PARAMETER  VALUE  THAT  YIELDS  RKSDS/ 
LKSDS 

INTEGER  VALUES  TO  DETERMINE  THE 
ITERATION  NUMBER  THAT  RKSDS /LKSDS 
OCCURS  ON 

MINIMUM  KOLMOGOROV  DISTANCE 
PARAMETER  VALUE  THAT  YIELDS  MKSDS 
MLE  ESTIMATE  OF  LOCATION  PARAMETER 
MLE  ESTIMATE  OF  SCALE  PARAMETER 


NOTE:  UNDEFINED  VARIABLES  ARE  DEFINED  IN  MAIN  PROGRAM 


* 

* 

* 

* 

* 

* 

% 

% 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 


********#:M****#***:M************************************** 


REAL  MKSDS . MKSL , LKSL , LKSDS . LKSD I S 

STEP=RMLES*SSZE 

ITER=1/SSZE 

CALL  VEDIS  (RMLEL, RMLES, DISA.DISB) 
IF  (DISA.GT.DISB)  THEN 
CKSDS=DISA 
ELSE 

CKSDS=DISB 
END  IF 
CKSL=RMLEL 
RKSDS=CKSDS 
EKSL=CKSL 
RKSL=CKSL 
J  =0 

DO  10  1=1, ITER 
EKSL=EK3L+STEP 

CALL  VEDIS  (EKSL, RMLES, DISA.DISB) 
IF  (DISA.GT.DISB)  THEN 
RKSDIS=DISA 
ELSE 

RKSDIS=DISB 
END  IF 

IF  (RKSDIS.LT. RKSDS)  THEN 
RKSDS=RKSD I S 
J=I 

RKSL=EK3L 
END  IF 
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CONTINUE 

IF  (J.EQ.ITER)  GOTO  5 

EKSL=CKSL 

LKSDS=CKSDS 

LKSL=CKSL 

K=0 

DO  20  1  =  1,1  TER 
EKSL=EKSL-STEP 

CALL  VEDIS  (EKSL, RMLES, DISA, DISB) 
IF  (DISA. GT. DISB)  THEN 
LKSDIS=DISA 
ELSE 

LKSDIS=DISB 
END  IF 

IF  (LKSDIS.LT. LKSDS)  THEN 
LKSDS=LKSD I S 
K=I 

LKSL=EKSL 
END  IF 
CONTINUE 

IF  (K.EQ.ITER)  GOTO  15 
IF  (RKSDS.LT. LKSDS)  THEN 
MKSDS=RKSDS 
MKSL=RKSL 
ELSE 

MKSDS=LKSDS 
MKSL=LKSL 
END  IF 
RETURN 
END 


SUBROUTINE  AEMIN  (SSZE, RMLEL, RMLES, MAEL, IFLAG) 


4 i  ********************** ************************************ 
*  * 

*  PURPOSE:  TO  PROVIDE  THE  LOCATION  ESTIMATE  THAT  MINIMIZES* 

*  THE  CRAMER  VON-MISES  (CVM) ,  ANDERSON-DARLING  * 

*  (AD) ,  OR  THE  WATSON  DISTANCE  * 


VARIABLES:  IFLAG  - 


STEP 


ITER 


CAEDS  - 


FLAG  USED  TO  DETERMINE  WHICH  DIS-  * 
TANCE  MEASURE  IS  TO  BE  USED  (CRAMER  * 
VON  MISES(O) , ANDERSON— DARL I NG ( 1 ) ,  * 

WATSON (2))  * 

THE  INCREMENT  ADDED  ON  TO  OR  SUB-  * 
TRACTED  FROM  THE  PREVIOUS  ESTIMATE  * 
THE  NUMBER  OF  STEPS  THAT  ARE  ADDED  * 
ON  TO  OR  SUBTRACTED  FROM  THE  INITIAL* 
ESTIMATE  * 

CVM,  AD,  OR  WATSON  DISTANCE  WITH  THE* 
MLE  ESTIMATE  AS  THE  PARAMETER  VALUE  * 
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CAEL  -  INITIAL  PARAMETER  VALUE  * 

RAEDIS  -  CVM.  AD,  OR  WATSON  DISTANCE  AT  EACH  * 
LAEDIS  STEP  TO  THE  RIGHT/LEFT  OF  THE  * 

INITIAL  ESTIMATE  * 

EAEL  -  PARAMETER  VALUE  AT  EACH  INCREMENT  * 
RAEDS  -  THE  MINIMUM  CVM,  AD,  OR  WATSON  * 

LAEDS  DISTANCE  TO  THE  RIGHT/LEFT  OF  THE  * 
INITIAL  ESTIMATE  * 

RAEL  -  PARAMETER  VALUE  THAT  YIELDS  RAEDS/  * 
LAEL  LAEDS  * 

J , K  -  INTEGER  VALUES  TO  DETERMINE  THE  * 

ITERATION  NUMBER  THAT  RAEDS/LAEDS  * 

OCCURS  ON  * 

MAEDS  -  MINIMUM  CVM,  AD,  OR  WATSON  DISTANCE  * 
MAEL  -  PARAMETER  VALUE  THAT  YIELDS  MAEDS  * 
RMLEL  -  MLE  ESTIMATE  OF  LOCATION  PARAMETER  * 
RMLES  -  MLE  ESTIMATE  OF  SCALE  PARAMETER  * 

* 

NOTE:  UNDEFINED  VARIABLES  ARE  DEFINED  IN  MAIN  PROGRAM  * 


*********************************************************** 

REAL  MAEDS , MAEL , LAEDS , LAED I S , LAEL 

STEP=RMLES*SSZE 

ITER=1/SSZE 

CALL  AED I S  ( RMLEL , RMLES , CAEDS , I FLAG ) 

CAEL=RMLEL 

RAEDS=CAEDS 

RAEL=CAEL 

EAEL=CAEL 

J=0 

DO  10  1=1, ITER 
EAEL=EAEL+STEP 

CALL  AED I S  <  EAEL , RMLES , R AED I S , I FLAG ) 

IF  (RAEDIS.LT. RAEDS)  THEN 
RAEDS=RAEDIS 
J=I 

RAEL=EAEL 
END  IF 
CONTINUE 

IF  (J.EQ.ITER)  GOTO  5 

EAEL=CAEL 

LAEL=CAEL 

LAEDS=CAEDS 

K=0 

DO  20  1=1, ITER 
EAEL=EAEL-5TEP 

CALL  AED I S  ( EAEL , RMLES , LAED I S , I FLAG ) 

IF  (LAEDIS.LT. LAEDS)  THEN 
LAEDS=LAEDIS 
K=I 

LAEL=EAEL 
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L 


31 


* 


END  IF 
CONTINUE 

IF  (K.EQ.ITER)  GOTO  15 
IF  (RAEDS.LT. LAEDS)  THEN 
MAEDS=RAEDS 
MAEL=RAEL 
ELSE 

MAEDS=LAEDS 
MAEL=LAEL 
END  IF 
RETURN 
END 


SUBROUTINE  KUMIN  (SSZE, RMLEL, RMLES, MKUL) 


*********************************************************** 
*  * 
*  PURPOSE:  TO  PROVIDE  THE  LOCATION  ESTIMATE  THAT  MINIMIZES* 


THE  KUIPER  DISTANCE 


VARIABLES: 


DISA 

DISB 

STEP 

ITER 


CKUDS  - 
CKUS 

RKUDIS  - 
LKUDIS 

EKUS 
RKUDS  - 
LKUDS 
RKUS 
LKUS 
J ,  K 


MKUDS  - 
MKUS 

RMLEL  - 
RMLES  - 


VERTICAL  DISTANCE  FROM  THE  CDF  TO  * 
THE  EDF  ABOVE / BELOW  THE  CDF  * 
THE  INCREMENT  ADDED  ON  TO  THE  * 
PREVIOUS  LOCATION  ESTIMATE  * 
THE  NUMBER  OF  STEPS  THAT  ARE  ADDED  * 
ON  TO  OR  SUBTRACTED  FROM  THE  INITIAL* 
ESTIMATE  * 
KUIPER  DISTANCE  WITH  THE  MLE  * 
ESTIMATE  AS  THE  PARAMETER  VALUE  * 
INITIAL  PARAMETER  VALUE  * 
KUIPER  DISTANCE  AT  EACH  STEP  * 
TO  THE  RIGHT/LEFT  OF  THE  INITIAL  * 
ESTIMATE  * 
PARAMETER  VALUE  AT  EACH  INCREMENT  * 
THE  MINIMUM  KUIPER  DISTANCE  TO  THE  * 
RIGHT/LEFT  THE  INITIAL  ESTIMATE  * 
PARAMETER  VALUE  THAT  YIELDS  RKUDS/  * 
LKUDS  * 
INTEGER  VALUES  TO  DETERMINE  THE  * 
ITERATION  NUMBER  THAT  RKUDS / LKUDS  * 
OCCURS  ON  * 
MINIMUM  KUIPER  DISTANCE  * 
PARAMETER  VALUE  THAT  YIELDS  MKUDS  * 
MLE  ESTIMATE  OF  LOCATION  PARAMETER  * 
(OR  OTHER  INITIAL  ESTIMATE)  * 
MLE  ESTIMATE  OF  SCALE  PARAMETER  * 
(OR  OTHER  INITIAL  ESTIMATE)  * 


NOTE:  UNDEFINED  VARIABLES  ARE  DEFINED  IN  MAIN  PROGRAM 


JL  JV  wS.  _ 
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n  n  n  n  m  r- j  cn  n  n 


* 

********************************************************** 

REAL  LKUD I S , LKUDS , LKUL , MKUDS , MKUL 

STEP=RMLES*SSZE 

ITER=1/SSZE 

CALL  VEDIS  <RMLEL, RMLES, DISA, DISB) 

CKUDS=DISA+DISB 

CKUL=RMLEL 

RKUl>S=CKUDS 

EKUL=CKUL 

J=0 

DO  10  1=1, ITER 
EKUL=EKUL+STEP 

CALL  VEDIS  (EKUL, RMLES, DISA, DISB) 

RKUD I S=D I SA+D I SB 
IF  (RKUDIS.LT. RKUDS)  THEN 
RKUDS=RKUD I S 
J=I 

RKUL=EKUL 
END  IF 

0  CONTINUE 

IF  (J.EQ. ITER)  GOTO  5 
LKUDS=CKUDS 
LKUL=CKUL 
EKUL=CKUL 
5  K=0 

DO  20  1=1, ITER 
EKUL=EKUl.-STEP 

CALL  VEDIS  (EKUL, RMLES, DISA, DISB) 

LKUD I S=D I SA+D I SB 
IF  (LKUDIS.LT. LKUDS)  THEN 
LKUDS=LKUD I S 
K=I 

LKUL = EKUL 
END  IF 

0  CONTINUE 

IF  (K.EQ.ITER)  GOTO  15 
IF  (RKUDS.LT. LKUDS)  THEN 
MKUDS=RKUDS 
MKUL=RKUL 
ELSE 

MKUDS=LKUDS 
MKUL=LKUL 
END  IF 

9  RETURN 
END 

SUBROUT I NE  VEDIS  ( ELOC , ESC ALE , D I SA , D I SB ) 


****************  ****************** *****  *  *  t  j-  *  m  y  *  *  *  *  *  *  ♦  t  *  *  t 
*  * 
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■*  * 


<y. 


c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


10 


c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


*  PURPOSE:  TO  CALCULATE  THE  VERTICAL  DISTANCE  ABOVE  AND  * 

*  BELOW  THE  CDF  TO  THE  EDF  * 

*  * 

*  VARIABLES:  ELOC  -  LOCATION  PARAMETER  USED  IN  CDF  * 

*  ESCALE  -  SCALE  PARAMETER  USED  IN  CDF  * 

*  CAU(I)  -  ARRAY  OF  SAMPLE  SIZE  N  USED  TO  STORE* 

*  CAUCHY  CDF  VALUES  * 

*  D1  -  VERTICAL  DISTANCE  FROM  THE  CDF  TO  * 

*  THE  EDF  WHERE  THE  CDF  VALUE  IS  * 

*  GREATER  * 

*  D2  VERTICAL  DISTANCE  FROM  THE  EDF  TO  * 

*  THE  CDF  WHERE  THE  EDF  VALUE  IS  * 

*  GREATER  * 

*  * 

*  NOTE:  UNDEFINED  VARIABLES  ARE  DEFINED  IN  MAIN  PROGRAM  * 

*  OR  HIGHER  SUBROUTINE  * 

*  * 

*  * 


*********************************************************** 


COMMON  R(20) , EDF (20) ,PI,N,RN 
REAL  CAU ( 20 ) 

DO  5  1  =  1 , N 

CAU ( I ) =. 5+ ( 1 /PI ) * (AT AN  < (R(I)-ELOC) /ESCALE) ) 
CONTINUE 
DISB=CAU ( 1 ) 

DISA=ABS (CAU ( 1 ) -EDF ( 1 ) ) 

DO  10  1=2, N 

D 1 =ABS ( CAU ( I ) —EDF ( I — 1 ) ) 

IF  (Dl.GT.DISB)  THEN 
DISB=D1 
END  IF 

D2=ABS (CAU ( I ) -EDF ( I ) > 

IF  (D2.GT.DISA)  THEN 
DISA=D2 
END  IF 
CONTINUE 
RETURN 
END 


SUBROUTINE  AEDIS  ( AEL, AES, DIS, IFLAG) 


*********************************************************** 


*  * 

*  PURPOSE:  TO  CALCULATE  THE  AREA  BETWEEN  THE  CDF  AND  THE  * 

*  EDF  * 

*  * 

*  VARIABLES:  AEL  -  LOCATION  PARAMETER  USED  IN  CDF  * 

*  AES  -  SCALE  PARAMETER  USED  IN  CDF  * 
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CAU  ( I ) 


C 

C 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


10 


20 


30 


40 


* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 


CAU ( I )  -  ARRAY  OF  SAMPLE  SIZE  N  USED  TO  STORE* 
CAUCHY  CDF  VALUES  * 

D IS  -  AREA  BETWEEN  THE  CDF  AND  EDF 

I FLAG  -  FLAG  USED  TO  DETERMINE  WHICH  DIS¬ 
TANCE  MEASURE  IS  TO  BE  USED  (CRAMER 
VON  MISES) (O) , ANDERSON-DARLING (1) , 
WATSON (2) 

Z  -  THE  AVERAGE  VALUE  OF  THE  CAUCHY 

CUMULATIVE  PROBABILITIES  (USED  IN 
WATSON’S  DISTANCE  MEASURE) 


NOTE:  UNDEFINED  VARIABLES  ARE  DEFINED 
OR  HIGHER  SUBROUTINE 


IN  MAIN  PROGRAM 


* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 


*********************************************************** 


COMMON  R ( 20 ) , EDF ( 20 ) , P I , N , RN 
REAL  CAU (20) 

DO  5  1=1 , N 

CAU ( I ) = . 5+ (1/PI)*(AT  AN ( (R ( I ) -AEL) / AES) ) 

CONTINUE 

DIS=0 

IF  ( I FLAG. EQ. 0)  THEN 
DO  10,  1=1, N 

DIS=DIS+(CAU(I)-(2*I-1)/ (2*RN) ) **2 
CONTINUE 

DIS=DIS+1 / ( 12*RN) 

END  IF 

IF  ( I FLAG. EQ. 1 )  THEN 
DO  20,  1=1, N 

DIS=DIS— (2*1—1)* (LOG (CAU ( I ) ) +LOG ( 1-CAU (N+l-I ) ) ) 
CONTINUE 
DIS=DIS/RN— RN 
END  IF 

IF  ( I FLAG. EQ. 2)  THEN 
Z=0 

DO  30,  1=1, N 

DIS=DIS+ (CAU ( I ) - (2*1-1 ) / (2*RN) ) **2 
CONTINUE 

DIS=DIS+1 / ( 1 2*RN ) 

DO  40,  1=1, N 

Z=Z+CAU( I) /RN 
CONTINUE 

DIS=DIS-RN* (Z-. 5) *  *2 
END  IF 
RETURN 
END 
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Appendix  C 

Computer  Program  Li  sting  o-f  CSPEE 


C 

C 

C 

C 

C 

C 

C 

C 

C 

C 

C 

C 

C 

C 

C 

C 

C 

C 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


******* * ******************** *************** **************** 


*  * 

*  PROGRAM  * 

*  * 

*  CAUCHY  SCALE  PARAMETER  ESTIMATION  AND  * 

*  EFFICIENCIES  (CSPEE)  * 

*  * 

*  WRITTEN  BY  CAPT  JOHN  0  SOURS,  AFIT/GS0-85D,  FOR  * 

*  MS  THESIS  * 

*  * 

*  DECEMBER  1985  * 

*  * 


*********************************************************** 


PROGRAM  CSPEE 


*********************************************************** 


* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 


PURPOSE:  TO  COMPARE  PARAMETER  ESTIMATION  TECHNIQUES  OF 
THE  SCALE  PARAMETER  OF  THE  TWO  PARAMETER 
CAUCHY  DISTRIBUTION.  THE  TECHNIQUES  COMPARED 
ARE  THE  MAXIMUM  LIKELIHOOD  METHOD  AND  MINIMUM 
DISTANCE  METHODS. 

VARIABLES:  R(20)  -  ARRAY  WHICH  HOLDS  THE  CAUCHY 

DEVIATES 

EDF (20) —  ARRAY  WHICH  HOLDS  VALUES  OF  THE 
EMPIRICAL  DISTRIBUTION  FUNCTION 
3. 1415926  RADIANS 
SAMPLE  SIZE 

REAL  VALUE  FOR  SAMPLE  SIZE 
SEED  VALUE  USED  FOR  GENERATING 
CAUCHY  DEVIATES  IN  GGCAY 
WK ( 60 )  -  ARRAY  USED  IN  GGCAY  FOR  WORKSPACE 
(3  TIMES  R ( I ) ) 

LOC  -  LOCATION  PARAMETER  VALUE  OF  THE 

CAUCHY  DISTRIBUTION  FROM  WHICH  THE 
DEVIATES  ARE  TAKEN 

SCALE  -  SCALE  PARAMETER  VALUE  OF  THE 

CAUCHY  DISTRIBUTION  FROM  WHICH  THE 
DEVIATES  ARE  TAKEN 

SSZE  -  STEP  SIZE  USED  IN  THE  MINIMUM 
DISTANCE  TECHNIQUES 
SUM  OF  THE  MEAN  SQUARED  ERRORS 
FOR  THE  MODIFIED  SEMI -INTERQUART  I LE 


PI 

N 

RN 

DSEED 


SUMME 

SUMMS 

SUMKU 


* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 


RANGE,  MLE  ESTIMATE  OF  SCALE,  KUIPER* 
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C 

* 

SUMWA 

AND  WATSON  DISTANCE 

* 

c 

* 

ESE 

- 

SUMSE/SUMMS 

* 

c 

* 

EKU 

- 

SUMKU/SUMMS 

* 

c 

* 

EWA 

- 

SUMWA /SUMMS 

* 

c 

* 

DSE 

- 

DIFFERENCE  BETWEEN  THE  SCALE  AND 

* 

c 

* 

MODIFIFIED  SEMI-INTERQUARTILE 

* 

c 

* 

RANGE 

* 

c 

* 

DSEQ 

- 

DSE  SQUARED  (MEAN  SQUARED  ERROR) 

* 

c 

* 

DMS 

- 

DIFFERENCE  BETWEEN  THE  SCALE  AND 

* 

c 

* 

MLE  OF  SCALE 

* 

c 

* 

DMSQ 

- 

DMS  SQUARED  (MEAN  SQUARED  ERROR) 

* 

c 

* 

LOCATION 

* 

c 

* 

DKU 

- 

DIFFERENCE  BETWEEN  THE  SCALE  AND 

* 

c 

* 

KUIPER  ESTIMATE  OF  SCALE 

* 

c 

* 

DKUQ 

- 

DKU  SQUARED  (MEAN  SQUARED  ERROR) 

* 

c 

* 

DWA 

- 

DIFFERENCE  BETWEEN  THE  SCALE  AND 

* 

c 

* 

WATSON  ESTIMATE  OF  SCALE 

* 

c 

* 

DWAQ 

- 

DWA  SQUARED  (MEAN  SQUARED  ERROR) 

* 

c 

* 

MEDIAN 

- 

THE  MEDIAN  OF  THE  SAMPLE 

* 

c 

* 

SEMIQ 

- 

MODIFIED  SEMI -INTERQUARTILE  RANGE 

* 

c 

* 

MLEL 

- 

MLE  ESTIMATE  OF  LOCATION 

* 

c 

* 

MLES 

- 

MLE  ESTIMATE  OF  SCALE 

* 

c 

* 

KUL 

- 

KUIPER  ESTIMATE  OF  SCALE 

* 

c 

* 

WAL 

- 

WATSON  ESTIMATE  OF  SCALE 

* 

c 

* 

* 

c 

* 

* 

C  *********************************************************** 

C 

C 

,  COMMON  R (20) , EDF (20) , PI , N, RN 

REAL  WK ( 60 ) . KUD I S , KUS , LOC , MLEL , MLES .MEDIAN 
DOUBLE  PRECISION  DSEED 
PRINT*, ’ENTER  SAMPLE  SIZE’ 

READ* , N 

PRINT*, ’ENTER  #  REPLICATIONS’ 

READ*, NR 

PRINT*, ’ INPUT  DSEED  (0  FOR  DEFAULT  VALUE  OF  4385673)’ 

READ*. DSEED 
IF  (DSEED. EQ.O)  THEN 
DSEED=4385673. 

END  IF 

PRINT*, ’ INPUT  LOCATION  AND  SCALE  PARAMETER’ 

READ*, LOC, SCALE 

OPEN <UNIT=10,FILE=’ DATA’ , STATUS=’ NEW’ ) 

WRITEdO,*)’  ’ 

WRITE ( 10, *)’ SAMPLE  SIZE  IS  ’,N,’  LOC  IS  ’,LOC.’  SCALE  IS 
’ , SCALE 

WRITEdO,*)' 

FT  =3. 1415926 
SSZE=. 01 
RN=N 
SUMSE=0 


rV-. 


’■■•v  - 
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SUMMS=0 

SUMKU=0 

SUMWA=0 


c 

C  CALCULATE  EMPIRICAL  D I SR I BUT I ON  FUNCTION 
C 

DO  6  1=1, N 

EDF(I)=I/RN 
6  CONTINUE 

DO  75  L=1 , NR 
C 

C  GENERATE  CAUCHY  DEVIATES 

C 

CALL  GGCAY  (DSEED, N, WK, R) 

DO  5  1=1, N 

R  < I ) =R ( I ) *SCALE+LOC 
5  CONTINUE 

C 

C  ORDER  MATRIX  OF  CAUCHY  DEVIATES 

C 

DO  25  1=1, N 

DO  35  J=1 , N-l 
IF  (R ( J ) . GT. R ( J+l ) )  THEN 
TEMP=R  < J ) 

R(J>=R(J+1) 

R ( J+l ) =TEMP 
END  IF 

35  CONTINUE 

25  CONTINUE 

C 

C  COMPUTE  MEDIAN  AND  MODIFIED  SEMI -INTERQUARTILE  RANGE 
C 

CALL  MEDSEM  <  MED I AN , SEM I Q ) 

DSE=SCALE-SEM I Q 
DSEQ=DSE**2 
SUMSE=SUMSE+DSEQ 
C 

C  CALCULATE  MLE  OF  LOCATION  AND  SCALE 
C 

CALL  CMLE  (MEDIAN, SEM IQ, MLEL, MLES) 

C 

DMS=SCALE-MLES 

DMSQ=DMS**2 

SUMMS=SUMMS+DMSQ 

C 

C  MINIMIZE  KUIPER  DISTANCE  TO  ESTIMATE  SCALE  PARAMETER 
C 

CALL  KUMIN  (SSZE. MLEL, MLES, KUS) 

DKU=SCALE— KUS 
DKUQ=DKU**2 
SUMKU=SUMKU+DKUQ 
C 
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V*  V'  - 


.  -  .■  L  + 

l-vaVa 


MINIMIZE  THE  WATSON  DIS  ANCE  TO  ESTIMATE  THE  SCALE  PARAMETER 


CALL  WAMIN  (SSZE, MLEL, MLES, WAS) 

DWA=SCALE— WAS 

DWAQ=DWA**2 

SUMWA=SUMWA+DWAO 

CONTINUE 

ESE=SUMSE/SUMMS 

EKU=SUMKU/SUMMS 

EWA=SUMWA/SUMMS 

WRITE ( 10, *> ’SUMSE  =  ’,SUMSE 

WRITEdO,  *> ’SUMMS  -  SUMMS 

WRITEdO,  *) ’SUMKU  =  ’ , SUMKU 

WRITE (10, *> ’SUMWA  -  SUMWA 

WRITEdO,*)’  ’ 

WRITE (10, *) ’ESE  =  ’ , ESE 
WRITEdO,  *)  ’EKU  =  ’  ,  EKU 
WRITEdO,  *) ’EWA  ®  ’  ,  EWA 
STOP 
END 


SUBROUTINE  MEDSEM  (MEDIAN, SEMIQ) 

*************** ********************************************* 
*  * 

*  PURPOSE:  TO  COMPUTE  THE  MEDIAN  AND  MODIFIED  SEMI -INTER-* 

*  QUART I LE  RANGE  OF  THE  CAUCHY  RANDOM  SAMPLE  * 


VARIABLES:  IH, IL, MH 


D 1 ,  D2 


SEMIQ1 


INTEGER  VARIABLES  USED 
IN  DETERMINING  THE  MODI¬ 
FIED  SEMI -INTERQUARTILE 
RANGE 

DIFFERENCES  BETWEEN  THE 
MEDIAN  AND  CAUCHY  RANDOM 
VARIATES 

VALUE  USED  IN  AVERAGING 
THE  RANGE  IF  THE  SAMPLE 
SIZE  IS  EVEN  AND  EQUAL  TO 
THE  RANGE  IF  THE  SAMPLE 
SIZE  IS  ODD 


NOTE:  UNDEFINED  VARIABLES 


DEFINED  IN  MAIN  PROGRAM 


************************************ *********************** 


COMMON  R ( 20 ) , EDF ( 20 ) , P I , N , RN 
REAL  MEDIAN 
MH= (N+2) /2 


V" 

>'  v'.v 

>:  v.v 


m 


MEDIAN=.5* ( R ( N— MH+ 1 ) +R  < MH ) ) 

SEMIQ=0 
IH=N 
IL  =1 

DO  1  1=1, MH 

D1  =  MEDIAN-R(IL) 

D2  =  R(IH) -MEDIAN 

SEM I Q 1 =SEM I Q 

IF  (D1.GT.D2)  GOTO  2 

I H= I H- 1 

SEMIQ=D2 

GOTO  1 

IL=IL+1 

SEMIQ=D1 

CONTINUE 

IF  (M0D(N,2) . EQ.O)  SEMIQ=. 5* (SEMIQ+SEMIQl ) 

RETURN 

END 


SUBROUTINE  CMLE  (MEDIAN, SEMIQ, MLEL, MLES) 


*********************************************************** 
*  * 

*  PURPOSE:  TO  CALCULATE  THE  MAXIMUM  LIKELIHOOD  ESTIMATORS* 

*  OF  THE  LOCATION  AND  SCALE  PARAMETERS  FROM  THE  * 

*  CAUCHY  SAMPLE  * 

*  * 

*  * 

*  VARIABLES:  IMAX  -  MAXIMUM  NUMBER  OF  ITERATIONS  ALLOWED* 


*  VARIABLES:  IMAX 

*  ITER 

*  MLELT 

*  MLEST 


SUMO 

SUM1 

MLESSQ 

Z 


TMLES  - 


-  ITERATION  COUNTER  * 

-  VALUE  OF  MLE  OF  LOCATION/SCALE  * 

PARAMETERS  AT  THE  BEGINNING  OF  EACH  * 
ITERATION  * 

-  SUMMATION  VARIABLES  USED  IN  THE  MLE  * 

ITERATION  PROCESS  * 

-  MLES  SQUARED  * 

-  VARIABLE  USED  IN  THE  MLE  ITERATION  * 

PROCESS  * 

-  SQUARE  ROOT  OF  THE  MLE  ESTIMATE  OF  * 

SCALE  * 


NOTE:  UNDEFINED  VARIABLES  ARE  DEFINED  IN  MAIN  PROGRAM 


********************* ******************************* * ****** 


COMMON  R ( 20  > , EDF ( 20 ) , P I , N , RN 

REAL  MLEL , MLES , MED I AN , MLELT , MLEST , MLESSQ 

MLEL=MEDI AN 


20 


10 


99 

C 

C 

C 

C 

C 

C 

C 

C 

C 

C 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


MLES=SEMIQ 

IMAX=100 

ITER=0 

MLELT=MLEL 

MLEST=MLES 

SUM0=0. 

SUM 1=0. 

MLESSQ=MLES**2 
DO  10  I  =  l.N 
Z=MLESSQ+  <R  ( I ) -MLEL ) **2 
SUM0=SUM0+ 1 . / Z 
SUM 1 =SUM 1 +R ( I ) / Z 
CONTINUE 

TMLES=DFLOAT (N) /2/SUMO/MLES** (1.5) 

MLES=TMLES**2 

MLEL=SUM 1 /SUMO 

ITER=ITER+1 

IF  (ITER.6T. IMAX)  GOTO  99 

IF  ( ABS ( MLEL— MLELT > . GT .  .001*MLES>  GOTO  20 
IF  < ABS (MLES— MLEST) .GT.  .05*MLES>  GOTO  20 
RETURN 
END 


SUBROUTINE  KUMIN  (SSZE, RMLEL, RMLES, MKUS) 


*********************** **************** ******************** 


* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 


PURPOSE:  TO  PROVIDE  THE  SCALE  ESTIMATE  THAT  MINIMIZES 
THE  KU I PER  DISTANCE 


VARIABLES: 


DISA 

DISB 

STEP1  - 
STEP2  - 
ITER 


CKUDS  - 
CKUS 

AKUDIS  - 

BKUDIS 

EKUS 

AKUDS  - 

BKUDS 

AKUS 

BKUS 

J,K 


VERTICAL  DISTANCE  FROM  THE  CDF  TO 
THE  EDF  ABOVE/BELOW  THE  CDF 
THE  INCREMENT  ADDED  ON  TO  THE 
PREVIOUS  SCALE  ESTIMATE 
THE  INCREMENT  SUBTRACTED  FROM  THE 
PREVIOUS  SCALE  ESTIMATE 
THE  NUMBER  OF  STEPS  THAT  ARE  ADDED 
ON  TO  OR  SUBTRACTED  FROM  THE  INITIAL* 
ESTIMATE  * 

KUIPER  DISTANCE  WITH  THE  MLE 
ESTIMATE  AS  THE  PARAMETER  VALUE 
INITIAL  PARAMETER  VALUE 
KUIPER  DISTANCE  AT  EACH  STEP 
ABOVE/BELOW  THE  INITIAL  ESTIMATE 
PARAMETER  VALUE  AT  EACH  INCREMENT 
THE  MINIMUM  KUIPER  DISTANCE 
ABOVE/BELOW  THE  INITIAL  ESTIMATE 
PARAMETER  VALUE  THAT  YIELDS  AKUDS/ 
BKUDS 

INTEGER  VALUES  TO  DETERMINE  THE 
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c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


10 


15 


*  ITERATION  NUMBER  THAT  AKUDS/BKUDS  * 

*  OCCURS  ON  * 

*  MKUDS  -  MINIMUM  KUIPER  DISTANCE  * 

*  MKUS  -  PARAMETER  VALUE  THAT  YIELDS  MKUDS  * 

*  RMLEL  -  MLE  ESTIMATE  OF  LOCATION  PARAMETER  * 

*  < OR  OTHER  INITIAL  ESTIMATE)  * 

*  RMLES  -  MLE  ESTIMATE  OF  SCALE  PARAMETER  * 

*  (OR  OTHER  INITIAL  ESTIMATE)  * 

*  * 

*  NOTE:  UNDEFINED  VARIABLES  ARE  DEFINED  IN  MAIN  PROGRAM  * 

*  * 

*  * 


*********************** ************ ************************ 


REAL  MKUDS, MKUS 
STEP1=RMLES*SSZE 
STEP2=. 5*RMLES*SSZE 
ITER=1/SSZE 

CALL  VEDIS  (RMLEL. RMLES. DISA, DISB) 

CKUDS=D I SA+D I SB 

CKUS=RMLES 

AKUDS=CKUDS 

EKUS=CKUS 

J  =0 

DO  10  1=1, ITER 
EKUS=£KUS+STEP1 

CALL  VEDIS  (RMLEL. EKUS, DISA. DISB) 
AKUD I S=D I SA+D I SB 
IF  (AKUDIS.LT. AKUDS)  THEN 
AKUDS=AKUDIS 
J  =  I 

AKUS=EKUS 
END  IF 
CONTINUE 

IF  (J.EQ.ITER)  GOTO  5 

BKUDS=CKUDS 

BKUS=CKUS 

EKUS=CKUS 

K=0 

DO  20  1=1, ITER 
EKUS=EKUS-STEP2 
IF  (EKUS. LE. O)  THEN 
GOTO  20 
END  IF 

CALL  VEDIS  (RMLEL, EKUS, DISA. DISB) 
BKUDIS=DISA+DISB 
I F  ( BKUD I S . LT . BKUDS )  THEN 
BKUDS=BKUD I S 
K=I 

BKUS=EKUS 
END  IF 


n  n 


20  CONTINUE 

IF  (K.EQ.ITER)  GOTO  15 
IF  (AKUDS.LT. BKUDS)  THEN 
MKUDS=AKUDS 
MKUS=AKUS 
ELSE 

MKUDS=BKUDS 
MKUS=BKUS 
END  IF 

89  RETURN 
END 


SUBROUTINE  WAMIN  (SSZE, RMLEL, RMLES, MWAS) 


*********************************************************** 
*  * 

*  PURPOSE:  TO  PROVIDE  THE  SCALE  ESTIMATE  THAT  MINIMIZES  * 

*  THE  WATSON  DISTANCE  * 


VARIABLES:  STEP1  - 


STEP2  - 


ITER 


CWADS  - 
CWAL 

AWADIS  - 

BWADIS 

EWAL 

AWADS  - 

BWADS 

AWAL 

BWAL 

J,K 


MWADS 

MWAL 

RMLEL 

RMLES 


THE  INCREMENT  ADDED  ON  TO  THE  * 

PREVIOUS  SCALE  ESTIMATE  * 

THE  INCREMENT  SUBTRACTED  FROM  THE  * 

PREVIOUS  SCALE  ESTIMATE  * 

THE  NUMBER  OF  STEPS  THAT  ARE  ADDED  * 

ON  TO  OR  SUBTRACTED  FROM  THE  INITIAL* 
ESTIMATE  * 

WATSON  DISTANCE  WITH  THE  MLE  * 

ESTIMATE, AS  THE  PARAMETER  VALUE  * 

INITIAL  PARAMETER  VALUE  * 

WATSON  DISTANCE  AT  EACH  STEP  * 

ABOVE/ BELOW  THE  INITIAL  ESTIMATE  * 

PARAMETER  VALUE  AT  EACH  INCREMENT  * 

THE  MINIMUM  WATSON  DISTANCE  * 

ABOVE/ BELOW  THE  INITIAL  ESTIMATE  * 

PARAMETER  VALUE  THAT  YIELDS  AWADS/  * 

BWADS  * 

INTEGER  VALUES  TO  DETERMINE  THE  * 

ITERATION  NUMBER  THAT  AWADS/BWADS  * 

OCCURS  ON  * 

MINIMUM  WATSON  DISTANCE  * 

PARAMETER  VALUE  THAT  YIELDS  MWADS  * 

MLE  ESTIMATE  OF  LOCATION  PARAMETER  * 

(OR  OTHER  INITIAL  ESTIMATE)  * 

-  MLE  ESTIMATE  OF  SCALE  PARAMETER  * 

(OR  OTHER  INITIAL  ESTIMATE)  * 


NOTE:  UNDEFINED  VARIABLES  ARE  DEFINED  IN  MAIN  PROGRAM 


*********************************************************** 
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n  n  co  r  j  *-*  *-  ui  n  n 


REAL  MWADS, MWAS 
STEP 1 =RMLES*SSZE 
STEP2= . 5*RMLES*SSZE 
ITER=1/SSZE 

CALL  AEDIS  (RMLEL, RMLES, CWADS, IFLAG) 

CWAS=RMLES 

AWADS=CWADS 

AWAS=CWAS 

EWAS=CWAS 

J=0 

DO  10  1=1, ITER 
EWAS=EWAS+STEP1 

CALL  AED I S  < RMLEL , E W AS , AWAD IS, IFLAG) 
IF  (AWADIS.LT. AWADS)  THEN 
AWADS= AW AD I S 
J=I 

AWAS=EWAS 
END  IF 

0  CONTINUE 

IF  (J.EQ.ITER)  GOTO  5 
EWAS=CWAS 
BWAS=CWAS 
BWADS=CWADS 
5  K=0 

DO  20  1=1, ITER 
EWAS=EWAS-STEP2 
IF  (EWAS.LE.O)  THEN 
GOTO  20 
END  IF 

CALL  AEDIS  (RMLEL, EWAS, BWAD IS, IFLAG) 
IF  (BWADIS.LT. BWADS)  THEN 
BWADS=BWAD I S 
K=I 

BWAS=EWAS 
END  IF 

0  CONTINUE 

IF  (K.EQ. ITER)  GOTO  15 
IF  (AWADS.LT. BWADS)  THEN 
MWADS=AWADS 
MWAS=AWAS 
ELSE 

MWADS=BWADS 
MWAS=BWAS 
END  IF 

9  RETURN 
END 


SUBROUTINE  VEDIS  (ELOC, ESCALE, DISA, DISB) 
C 
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*3| t********************************************************* 


*  * 

*  PURPOSE:  TO  CALCULATE  THE  VERTICAL  DISTANCE  ABOVE  AND  * 

*  BELOW  THE  CDF  TO  THE  EDF  * 

*  * 

*  VARIABLES:  ELOC  -  LOCATION  PARAMETER  USED  IN  CDF  * 

*  ESCALE  -  SCALE  PARAMETER  USED  IN  CDF  * 

*  CAU(I)  -  ARRAY  OF  SAMPLE  SIZE  N  USED  TO  STORE* 

*  CAUCHY  CDF  VALUES  * 

*  D1  VERTICAL  DISTANCE  FROM  THE  CDF  TO  * 

*  THE  EDF  WHERE  THE  CDF  VALUE  IS  * 

*  GREATER  * 

*  D2  -  VERTICAL  DISTANCE  FROM  THE  EDF  TO  * 

*  THE  CDF  WHERE  THE  EDF  VALUE  IS  * 

*  GREATER  * 

*  * 

*  NOTE:  UNDEFINED  VARIABLES  ARE  DEFINED  IN  MAIN  PROGRAM  * 

*  ORIGHER  SUBROUTINE  * 

*  * 

*  * 


*********************************************************** 


COMMON  R<20) , EDF (20) ,PI,N,RN 
REAL  CAU ( 20 ) 

DO  5  1=1, N 

CAU  (D=.5+(l  /PI  >  *  ( ATAN  (  (R  ( I )  -ELDC  >  /ESCALE)  ) 
CONTINUE 
DISB=CAU(1) 

DISA=ABS (CAU ( 1 ) -EDF ( 1 ) ) 

DO  10  1=2. N 

D 1 = ABS  <  CAU ( I ) —EDF ( I  —  1 ) ) 

IF  (Dl.GT.DISB)  THEN 
DISB=D1 
END  IF 

D2=ABS (CAU ( I ) -EDF ( I ) ) 

IF  (D2.GT.DISA)  THEN 
DISA=D2 
END  IF 
CONT I NUE 
RETURN 
END 


SUBROUTINE  AEDIS  < AEL, AES, DIS> 


*********************************************************** 

«  * 

*  PURPOSE:  TO  CALCULATE  THE  AREA  BETWEEN  THE  CDF  AND  THE  * 


c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


30 


40 


*  * 

*  VARIABLES:  AEL  -  LOCATION  PARAMETER  USED  IN  CDF  * 

*  AES  -  SCALE  PARAMETER  USED  IN  CDF  * 

*  CAU(I)  -  ARRAY  OF  SAMPLE  SIZE  N  USED  TO  STORE* 

*  CAUCHY  CDF  VALUES  * 

*  DIS  AREA  BETWEEN  THE  CDF  AND  EDF  * 

*  Z  THE  AVERAGE  VALUE  OF  THE  CAUCHY  * 

*  CUMULATIVE  PROBABILITIES  (USED  IN  * 

*  WATSON’S  DISTANCE  MEASURE)  * 

*  * 

*  NOTE:  UNDEFINED  VARIABLES  ARE  DEFINED  IN  MAIN  PROGRAM  * 

*  OR  HIGHER  SUBROUTINE  * 

*  * 

*  * 


*********************************************************** 


COMMON  R ( 20 ) , EDF ( 20 ) , P I , N , RN 
REAL  C AU ( 20 ) 

DO  5  1=1, N 

CAU ( I )  =  . 5+ ( 1 /PI ) * (AT AN ( (R ( I ) -AEL) / AES) ) 
CONTINUE 
D I  S=0 
Z=0 

DO  30,  1=1, N 

DIS=DIS+ (CAU ( I ) - <2*1—1 ) / (2*RN) )**2 
CONTINUE 

DIS=DIS+1 / ( 12*RN) 

DO  40,  1=1, N 

Z=Z+CAU(I)/RN 

CONTINUE 

DIS=DIS-RN* (Z-. 5) **2 
RETURN 
END 
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Appendix  D 


C 

C 

C 

C 

C 

c 

c 

c 

c 

c 

c 

c 

c 


Computer  Subprogram  Listing  of  VEDIS 
and  a£d!S  for  Censored Samples 

************************* ********************************** 


*  * 

*  SUBROUTINES  * 

*  * 

*  VEDIS  AND  AEDIS  FOR  CENSORED  SAMPLES  * 

*  * 

*  WRITTEN  BY  CAPT  JOHN  0  SOURS,  AFIT/GS0-85D,  FOR  * 

*  MS  THESIS  * 

*  * 

*  DECEMBER  1985  * 

*  * 


***************************************************** ****** 


C 

C 

C 

C 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


SUBROUTINE  VEDIS  (ELOC, ESCALE, DISA, DISB) 


*********************************************************** 


*  * 

*  PURPOSE:  TO  CALCULATE  THE  VERTICAL  DISTANCE  ABOVE  AND  * 

*  BELOW  THE  CDF  TO  THE  EDF  FOR  CENSORED  SAMPLES  * 

*  * 

*  VARIABLES:  ELOC  -  LOCATION  PARAMETER  USED  IN  CDF  * 

*  ESCALE  -  SCALE  PARAMETER  USED  IN  CDF  * 

*  CAU(I)  -  ARRAY  OF  SAMPLE  SIZE  N  USED  TO  STORE* 

*  CAUCHY  CDF  VALUES  * 

*  D1  —  VERTICAL  DISTANCE  FROM  THE  CDF  TO  * 

*  THE  EDF  WHERE  THE  CDF  VALUE  IS  * 

*  GREATER  * 

*  D2  VERTICAL  DISTANCE  FROM  THE  EDF  TO  * 

*  THE  CDF  WHERE  THE  EDF  VALUE  IS  * 

*  GREATER  * 

*  * 

*  NOTE:  UNDEFINED  VARIABLES  ARE  DEFINED  IN  MAIN  PROGRAM  * 

*  OR  HIGHER  SUBROUTINE  * 

*  * 

*  * 


*********************************************************** 


COMMON  R(20) , EDF (20) , PI ,N,RN 
REAL  CAU (20) 

DO  5  1=3, N-2 

CAU  < I ) =, 5+ ( 1/PI ) * (ATAN ( (R( I ) -ELOC) /ESCALE) ) 
CONTINUE 

DISB=ABS (CAU (3> -EDF (2) ) 

DISA=ABS (CAU (3) -EDF (3) ) 
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2/fc 

NL 


DO  10  1=4. N-2 

D 1 = ABS ( C AU ( I ) -EDF ( I - 1 > ) 
IF  (Dl.GT.DISB)  THEN 
DISB=D1 
END  IF 

D2=ABS (CAU  ( I ) -EDF  < I ) ) 

IF  (D2.GT.DISA)  THEN 
DISA=D2 
END  IF 
CONTINUE 
RETURN 
END 


SUBROUTINE  AEDIS  ( AEL , AES , D I S , I FLAG ) 


* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 


PURPOSE:  TO  CALCULATE  THE  AREA  BETWEEN  THE  CDF  AND  THE 
EDF  FOR  CENSORED  SAMPLES 


VARIABLES: 


AEL 
AES 
CAU ( I ) 

DIS 
I  FLAG 


LOCATION  PARAMETER  USED  IN  CDF 
SCALE  PARAMETER  USED  IN  CDF 


* 

* 

* 

* 

* 

* 


-  ARRAY  OF  SAMPLE  SIZE  N  USED  TO  STORE* 


NOTE: 


CAUCHY  CDF  VALUES 
AREA  BETWEEN  THE  CDF  AND  EDF 
FLAG  USED  TO  DETERMINE  WHICH  DIS¬ 
TANCE  MEASURE  IS  TO  BE  USED  (CRAMER 
VON  MISES) (0)  , ANDERSON- DARLING ( 1 )  , 
WATSON (2) 

THE  AVERAGE  VALUE  OF  THE  CAUCHY 
CUMULATIVE  PROBABILITIES  (USED  IN 
WATSON’S  DISTANCE  MEASURE) 


UNDEFINED  VARIABLES  ARE  DEFINED  IN  MAIN  PROGRAM 
OR  HIGHER  SUBROUTINE 


COMMON  R ( 20 ) , EDF ( 20 ) , P I , N , RN 
REAL  CAU (20) 

DO  5  1=3, N-2 

CAU ( I ) =. 5+ ( 1 /PI ) * (AT AN ( (R ( I > -AEL ) / AES) ) 

CONTINUE 

DIS=0 

IF  ( I FLAG. EQ. O)  THEN 
DO  10,  1=3, N-3 

DIS=DI3+(I**2/RN**2) * (CAU( 1+1 ) -CAU ( I ) ) - ( 1/ (RN) ) * (CAU (1+1 ) 
**2-CAU ( I ) **2>  +  ( 1/3. >  * (CAU ( 1  +  1 ) **3-CAU( I) **3) 


CONTINUE 
END  IF 

IF  ( I FLAG. EQ. 1 )  THEN 
DO  20,  1=3, N-3 

DIS=DIS+( I**2/RN#*2>  *L0G(CAU(I+1 )  /  ( 1-CAU  ( 1  +  1 >  > )  + (I*2/RN)  * 

)  LOG ( 1-CAU ( 1+1 > ) -CAU ( 1  +  1  > -LOG ( 1-CAU ( 1  +  1 ) ) - ( I **2/RN**2> *LOG 
)  ( CAU ( I ) / ( 1 -CAU  < I ) ) >-<I*2/RN)*L0G(l-CAU(I) >+CAU(I) 

) +LOG ( 1 -CAU ( I > ) 

CONTINUE 
END  IF 

IF  ( I FLAG. EQ. 2)  THEN 
Z=0 

DO  30,  1=3, N-3 

DIS=DIS+ ( I**2/RN**2> * (CAU ( 1  +  1 ) -CAU  ( I ) ) - < 1/ (RN) ) * (CAU ( 1  +  1 ) 
)  **2-CAU ( I ) **2>  +  ( 1 /3. ) * (CAU ( 1  +  1 >  **3-CAU ( I ) **3) 

CONTINUE 
DO  40,  1=3, N-3 

Z=Z+ ( I/RN) * (CAU ( 1  +  1 ) -CAU ( I ) ) -. 5# (CAU ( 1  +  1 ) **2-CAU ( I >  **2> 
CONTINUE 
Z=Z**2 
DIS=DIS— Z 
END  IF 
RETURN 
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Abstract 


Minimum  distance  estimators  are  compared  to  the  maximum  likelihood  estimator  of  the 
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Comparison  of  the  two  methods  is  done  by  comparing  the  relative  effici  cies.  Both 
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